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Methods and Compositions for Controlling 
Cell Death and Disease Resistance in Plants 

Field of the Invention 
The invention relates to the genetic manipulation of 
plants, particularly to novel genes and proteins and their 
uses in regulating cell death and disease resistance in 
5 plants. 

Background of the Invention 
A host of cellular processes enable plants to defend 
themselves from disease caused by pathogenic agents. These 
processes apparently form an integrated set of resistance 

10 mechanisms that is activated by initial infection and then 
limits further spread of the invading pathogenic 
microorganism . 

Subsequent to recognition of a potentially pathogenic 
microbe, plants can activate an array of biochemical 

15 responses. Generally, the plant responds by inducing 

several local responses in the cells immediately surrounding 
the infection site. The most common resistance response 
observed in both nonhost and race- specif ic interactions is 
termed the "hypersensitive response" (HR) . In the 

20 hypersensitive response, cells contacted by the pathogen, 
and often neighboring cells, rapidly collapse and dry in a 
necrotic fleck. Other responses include the deposition of 
callose, the physical thickening of cell walls by 
lignif ication, and the synthesis of various antibiotic small 

25 molecules and proteins. Genetic factors in both the host 
and the pathogen determine the specificity of these local 
responses which can be very effective in limiting the spread 
of infection. 

Many environmental and genetic factors cause general 
30 leaf necrosis in maize and other plants. In addition, 
numerous recessive and dominant genes have been reported 
which cause discreet necrotic lesions to form. These lesion 
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mutants mimic disease lesions caused by various pathogenic 
organisms of maize. For example, Lesl, a temperature- 
sensitive conditional lethal mutant, mimics the appearance 
of Helminthosporium maydis on susceptible maize. 
5 Many genes causing necrotic lesions have been reported. 

The pattern of lesion spread on leaves is a function of two 
factors: lesion initiation and individual lesion 
enlargement . 

The lethal leaf spot-1 (llsl) mutation of maize is 

10 inherited in a recessive monogenic fashion and is 

characterized by the formation of scattered, necrotic leaf 
spots (lesions) that expand continuously to engulf the 
entire tissue. Since llsl spots show striking resemblance 
to lesions incited by race 1 of Cochliobolus 

15 (Helminthosporium) carbonum on susceptible maize, this 

mutation has been grouped among the class of genetic defects 
in maize called "disease lesion mimics." 

Lesion mimic mutations of maize have been shown to be 
specified by more than forty independent loci. These lesion 

20 mimic plants produce discreet disease -like symptoms in the 
absence of any invading pathogens. It is intriguing that 
more than two thirds of these mutations display a partially 
dominant, gain-of -function inheritance, making it the 
largest class of dominant mutants in maize, and suggesting 

25 the involvement of a signalling pathway in the induction of 
lesions in these mutations. Similar mutations have also 
been discovered in other plants including arabidopsis and 
barley. 

Despite the availability of the large number of lesion 
3 0 mimic mutations in plants, the mechanistic basis and 

significance of this phenomenon, and the wild-type function 
of the genes involved, has remained elusive. The 
understanding of the molecular and cellular events that are 
responsible for plant disease resistance remains 
35 rudimentary. This is especially true of the events 

controlling the earliest steps of active plant defense, 
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recognition of a potential pathogen and transfer of the 
cognitive signal throughout the cell and surrounding tissue. 

Diseases are particularly destructive processes 
resulting from specific causes and characterized by specific 
5 symptoms. Generally the symptoms can be related to a 

specific cause, usually a pathogenic organism. In plants, a 
variety of pathogenic organisms cause a wide variety of 
disease symptoms. Because of the lack of understanding of ' 
the plant defense system, methods are needed to protect 
10 plants against pathogen attack. 

Summary of the Invention 
Compositions and methods for suppressing cell death and 
controlling disease resistance in plants are provided. The 
compositions, cell death suppressing proteins and the genes 

15 encoding such proteins, are useful for activating disease 
resistance, enhancing plant cell transformation efficiency, 
engineering herbicide resistance, genetically targeting cell 
ablations, and other methods involving the regulation of 
cell death and disease resistance in plants. 

20 Additionally, novel promoter sequences are provided for 

the expression of genes in plants. 

Brief Description of the Drawings 
Figure 1 sets forth the organization of the 3kb EcoRI 
restriction fragment containing lis sequence. 
25 Figure 2 shows that a single transcript was detected 

when mRNA from mature leaves was probed with the llsl 
transcript . 

Figure 3 shows the preferred sites for possible 
modification of the protein to alter protein activity (SEQ 
30 ID NOS 2 & 5-61, respectively) . 

Detailed Description of the Invention 
The invention is drawn to compositions and methods for 
controlling cell death and disease resistance in plant 
cells. The compositions are proteins, ring-hydroxylating 
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dioxygenases, which act to control cell death and regulate 
disease resistance in plants. The proteins and genes 
encoding them can be used to regulate cell death and disease 
resistance in transformed plant cells as well as a variety 
5 of other uses. The proteins are useful in resistance to 
pathogens and survival of the cells particularly after 
pathogen attack. 

One aspect of the invention is drawn to proteins which 
are involved in the degradation of plant phenolics, cell 

10 death- suppressing and disease resistance proteins. Such 
proteins are characterized by containing two consensus 
motifs, a Rieske-type iron-sulfur binding site, and a 
mononuclear iron-binding site, and function as aromatic 
ring -hydroxyla ting (ARH) dioxygenases . The Rieske motif 

15 contains two cysteine and histidine residues responsible for 
binding an iron atom cof actor. Plant proteins containing at 
least one of the motifs have been identified and can be used 
in the methods of the present invention.. Alternatively, 
proteins from bacteria with the Rieske motif are known in 

20 the art and can be used in the methods of the invention. 
Bacterial proteins of particular interest are ring- 
hydroxylating dioxygenases, particularly those from the 
cyanobacterium Synechocystis . See, for example, Gibson et 
al. (1984) Microbial degradation of organic compounds, 181- 

25 252. D.T. Gibson, ed. (New York: Marcel Dekker) , pp. 181- 
252. 

The cell death- suppressing and disease resistance 
proteins of the invention encompass a novel class of plant 
proteins. The amino acid sequence of the Usl protein 

30 isolated from maize is set forth in SEQ ID NOS 1 & 2, 
respectively. However, the proteins are conserved in 
plants. Thus, as discussed below, methods are available for 
the identification and isolation of genes and proteins from 
any plant. Likewise, sequence similarities can be used to 

35 identify and isolate other bacterial genes and proteins. 
The proteins function to inhibit the spread of cell death 
and control disease resistance in plants. Therefore, the 
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proteins are useful in a variety of settings involving the 
regulation of cell death and control of disease resistance 
in plants. 

The Rieske motif exhibited by the proteins of the 
5 invention is shared by a class of enzymes known as ring- 

hydroxylating dioxygenases . The motif contains two cysteine 
and histidine residues responsible for binding an iron atom 
cofactor - residues that are shared by other proteins termed 
Rieske iron-sulfur proteins. The bacterial genes included 

10 in the proteins of the invention are known as catabolic 

operons. Thus, it is predicted that the plant proteins are 
related to the degradation of phenolic compound (s). In 
fact, a para-coumaric ester accumulates in llsl lesioned 
plants, but not in normal -type siblings or wild- type 

15 siblings inoculated with the fungus Cochliobolus 

heggerostrophua . While the present invention is not 
dependent upon any particular mechanism of action, it is 
believed that the cell death- suppressing function of the 
novel protein may be mediated by the detoxification of a 

20 phenolic compound whose cell damaging effects are fueled by 
light harvested by photosynthetically- functional pigments in 
the leaf . 

Modifications of such proteins are also encompassed by 
the present invention. Such modifications include 
25 substitution of amino acid residues, deletions, additions, 
and the like. For example, the protein can be mutagenized 
in such a way that its activity is reduced, but not 
completely abolished. See, for example, Jiang et al . 
(1996), J. Bacterial, 178:3133-3139, where the Tyr-221 from 
30 the mononucleate iron binding site of toluene dioxygenase 
was changed to Ala. This change resulted in a reduction in 
activity to 42% of the normal activity. A change of Tyr-266 
to Ala reduced the activity to 12%. In the same manner, 
amino acid changes, particularly changes from Tyr to Ala, of 
3 5 the sequence of the proteins of the present invention can 
lead to increases or decreases in activity. Figure 3 sets 
forth potential modifications which may alter expression of 



5 



WO 98/39422 



PCT7US98/04040 



the resulting protein. See also SEQ ID NOS 2 & 5-61, 
respectively. Such modifications can result in dominant 
negative inhibitors of the wild type protein. Using these 
sequences, the expression of llsl can be regulated such that 
5 disease resistance can be obtained in the absence of 
lesions . 

After each modification of the protein, the resulting 
protein will be tested for activity. To test for activity, 
plants can be transformed with the DNA sequence and tested 

10 for their response to a fungal pathogen. Of particular 

interest are changes that result in a reduction of activity. 
Such changes will confer disease resistance, yet not result 
in the lesion phenotype. These modified proteins, and the 
corresponding genes, will be useful in disease defense 

15 mechanisms in plants. 

Accordingly, the proteins of the invention include 
naturally occurring plant and bacterial proteins and 
modifications thereof. Such proteins find use in preventing 
cell death and controlling disease resistance. The proteins 

20 are also useful in protecting plants against pathogens. In 
this manner, the plant is transformed with a nucleotide 
sequence encoding the protein. The expression of the 
protein in the plant prevents cell death and confers 
resistance to infection by plant pathogens. 

25 The nucleotide sequences encoding the novel proteins 

are also provided. The llsl gene from maize encodes the 
novel maize protein which inhibits the spread of cell death 
from wounding or internal stresses that occur during 
photosynthesis. The maize gene can be utilized to isolate 

30 homologous genes from other plants, including Arabidopsis, 
sorghum, Brassica, wheat, tobacco, cotton, tomato, barley, 
sunflower, cucumber, alfalfa, soybeans, sorghum, etc. 

Methods are readily available in the art for the 
hybridization of nucleic acid sequences. Coding sequences 

35 from other plants may be isolated according to well known 
techniques based on their sequence homology to the maize 
coding sequences set forth herein. In these techniques all 
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or part of the known coding sequence is used as a probe 
which selectively hybridizes to other cell death- suppressor 
coding sequences present in a population of cloned genomic 
DNA fragments or cDNA fragments (i.e. genomic or cDNA 
5 libraries) from a chosen organism. 

For example, the entire llsl sequence or portions 
thereof may be used as probes capable of specifically 
hybridizing to corresponding coding sequences and messenger 
RNAs. To achieve specific hybridization under a variety of 

10 conditions, such probes include sequences that are unique 
among llsl coding sequences and are preferably at least 
about 10 nucleotides in length, and most preferably at least 
about 2 0 nucleotides in length. Such probes may be used to 
amplify llsl coding sequences from a chosen organism by the 

15 well -know process of polymerase chain reaction (PCR) . This 
technique may be used to isolate additional llsl coding 
sequences from a desired organism or as a diagnostic assay 
to determine the presence of llsl coding sequences in an 
organism. 

20 Such techniques include hybridization screening of 

plated DNA libraries (either plaques or colonies; see, e.g. . 
Sambrook et al . , Molecular Cloning, eds., Cold Spring Harbor 
Laboratory Press (1989)) and amplification by PCR using 
oligonucleotide primers corresponding to sequence domains 

25 conserved among the amino acid sequences (see, e.g. Innis et 
al., PCR Protocols, a Guide to Methods and Applications, 
eds. , Academic Press (1990) ) . 

For example, hybridization of such sequences may be 
carried out under conditions of reduced stringency, medium 

30 stringency or even stringent conditions (e.g., conditions 

represented by a wash stringency of 35-40% Formamide with 5x 
Denhardt's solution, 0.5% SDS and lx SSPE at 37° C; 
conditions represented by a wash stringency of 40-45% 
Formamide with 5x Denhardt's solution, 0.5% SDS, and lx SSPE 

35 at 42° C; and conditions represented by a wash stringency of 
50% Formamide with 5x Denhardt's solution, 0.5% SDS and lx 
SSPE at 42° C, respectively) , to DNA encoding the cell death 
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suppressor genes disclosed herein in a standard 
hybridization assay. See J . Sambrook et al . , Molecular 
Cloning, A Laboratory Manual 2d Ed. (1989) Cold Spring 
Harbor Laboratory. In general, sequences which code for a 
5 cell death suppressor and disease resistance protein and 

hybridize to the maize llsl gene disclosed herein will be at 
least 50% homologous, 70% homologous, and even 85% 
homologous or more with the maize sequence. That is, the 
sequence similarity of sequences may range, sharing at least 
10 about 50%, about 70%, and even about 85% sequence 
similarity. 

Generally, since leader peptides are not highly 
conserved between monocots and dicots, sequences can be 
utilized from the carboxyterminal end of the protein as 
15 probes for the isolation of corresponding sequences from any 
plant. Nucleotide probes can be constructed and utilized in 
hybridization experiments as discussed above. In this 
manner, even gene sequences which are divergent in the 
aminoterminal region can be identified and isolated for use 
20 in the methods of the invention. 

Also provided are mutant forms of the llsl gene (the 
cell death suppressor and disease resistance gene) and the 
proteins they encode. Methods for mutagenesis and 
nucleotide sequence alterations are well known in the art. 
25 See, for example, Kunkel, T. (1985) Proc. Natl. Acad. Sci. 
USA 82:488-492; Kunkel et al . (1987) Methods in Enzymol. 
154:367-382; US Patent No. 4,873,192; Walker and Gaastra 
(eds.) Techniques in Molecular Biology, MacMillan Publishing 
Company, NY (1983) and the references cited therein. Thus, 
3 0 the genes and nucleotide sequences of the invention include 
both the naturally occurring sequences as well as mutant 
forms. Likewise, the proteins of the invention encompass 
both naturally occurring proteins as well as variations and 
modified forms thereof. 
35 The nucleotide sequences encoding the proteins or 

polypeptides of the invention are useful in the genetic 
manipulation of plants. In this manner, the genes of the 
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invention are provided in expression cassettes for 
expression in the plant of interest. The cassette will 
include 5' and 3' regulatory sequences operably linked to 
the gene of interest. The cassette may additionally contain 
5 at least one additional gene to be cotrans formed into the 
organism. Alternatively, the gene(s) of interest can be 
provided on another expression cassette. Where appropriate, 
the gene(s) may be optimized for increased expression in the 
transformed plant. Where bacterial ring-hydroxylating 

10 dioxygenases are used in the invention, they can be 
synthesized using plant preferred codons for improved 
expression. Methods are available in the art for 
synthesizing plant preferred genes. See, for example, U.S. 
Patent Nos. 5,380,831, 5,436, 391, and Murray et al . (1989) 

15 Nucleic Acids Res. 17:477-498, herein incorporated by 
reference . 

The expression cassettes may additionally contain 5' 
leader sequences in the expression cassette construct . Such 
leader sequences can act to enhance translation. 
20 Translation leaders are known in the art and include: 
picornavirus leaders, for example, EMCV leader 
(Encephalomyocarditis 5' noncoding region) (Elroy-Stein, O., 
Fuerst, T.R., and Moss, B. (1989) PNAS USA, 86:6126-6130); 
potyvirus leaders, for example, TEV leader (Tobacco Etch 
25 Virus) (Allison et al . (1986); MDMV leader (Maize Dwarf 

Mosaic Virus); Virology, 154:9-20) , and human immunoglobulin 
heavy-chain binding protein (BiP) , (Macejak, D.G., and P. 
Sarnow (1991) Nature, 353:90-94; untranslated leader from 
the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4) , 
30 (Jobling, S.A., and Gehrke, L. , (1987) Nature, 325:622-625; 
tobacco mosaic virus leader (TMV) , (Gallie, D.R. et al. 
(1989) Molecular Biology of RNA, pages 237-256; and maize 
chlorotic mottle virus leader (MCMV) (Lommel, S.A. et al . 
(1991) Virology, 81:382-385). See also, Della-Cioppa et al . 
35 (1987) Plant Physiology, 84:965-968. Other methods known to 
enhance translation can also be utilized, for example, 
introns, and the like. 
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In preparing the expression cassette, the various DNA 
fragments may be manipulated, so as to provide for the DNA 
sequences in the proper orientation and, as appropriate, in 
the proper reading frame. Towards this end, adapters or 
5 linkers may be employed to join the DNA fragments or other 
manipulations may be involved to provide for convenient 
restriction sites, removal of superfluous DNA, removal of 
restriction sites, or the like. For this purpose, in vitro 
mutagenesis, primer repair, restriction, annealing, 
10 resection, ligation, PCR, or the like may be employed, where 
insertions, deletions or substitutions, e.g. transitions and 
transversions, may be involved. 

The compositions and methods of the present invention 
can be used to transform any plant. In this manner, 
15 genetically modified plants, plant cells, plant tissue, 
seed, and the like can be obtained. Transformation 
protocols may vary depending on the type of plant or plant 
cell, i.e. monocot or dicot, targeted for transformation. 
Suitable methods of transforming plant cells include 
20 microinjection (Crossway et al . (1986) Biotechniques 

4:320-334} , electroporation (Riggs et al. (1986) Proc. Natl. 
Acad. Sci. USA, 83:56 02-5606, Agrobacterium mediated 
transformation (Hinchee et al . (1988) Biotechnology, 
6:915-921), direct gene transfer (Paszkowski et al . (1984) 
25 EMBO J., 3:2717-2722) , and ballistic particle acceleration 
(see, for example, Sanford et al., U.S. Patent 4,945,050; 
and McCabe et al. (1988) Biotechnology, 6: 923-926) . Also 
see, Weissinger et al. (1988) Annual Rev. Genet., 
22:421-477 ; Sanford et al . (1987) Particulate Science and 
30 Technology, 5:27-37 (onion); Christou et al . (1988) Plant 
Physiol. 87:671-674 (soybean) ; McCabe et al . (1988) 
Bio/Technology, 6:923-926 (soybean); Datta et al. (1990) 
Biotechnology, 8: 736-740 (rice) ; Klein et al . (1988) Proc. 
Natl. Acad. Sci. USA, 85:4305-4309 (maize) ; Klein et al. 
35 (1988) Biotechnology, 6:559-563 (maize); Klein et al . (1988) 
Plant Physiol., 91 : 440-444 (maize) ; Fromm et al . (1990) 
Biotechnology, 8:833-839; and Tomes et al. "Direct DNA 

10 
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transfer into intact plant cells via microprojectile 
bombardment " In: Gamborg and Phillips (Eds.) Plant Cell, 
Tissue and Organ Culture: Fundamental Methods, Springer- 
Verlag, Berlin (1995) ; Hooydaas-Van Slogteren & Hooykaas 
5 (1984) Nature (London), 311:763-764; Bytebier et al . (1987) 
Proc. Natl. Acad. Sci. USA, 84:5345-5349 (Liliaceae) ; De Wet 
et al. (1985) In The Experimental Manipulation of Ovule 
Tissues, ed. G.P. Chapman et al . , pp. 197-209. Longman, NY 
(pollen); Kaeppler et al . (1990) Plant Cell Reports, 5:415- 
10 418; and Kaeppler et al. (1992) Theor. Appl. Genet., 54:560- 
566 (whisker-mediated transformation); D'Halluin et al. 

(1992) Plant Cell, 4:1495-1505 (electroporation) ; Li et al . 

(1993) Plant Cell Reports, 12:250-255 and Christou and Ford 
(1995) Annals of Botany, 75:407-413 (rice); Osjoda et al . 

15 (1996) Nature Biotechnology, 14:745-750 (maize via 
Agrobacterium tumefaciens) ; all of which are herein 
incorporated by reference. 

The cells which have been transformed may be grown into 
plants in accordance with conventional ways. See, for 

20 example, McCormick et al . (1986) Plant Cell Reports, 5:81- 
84. These plants may then be grown, and either pollinated 
with the same transformed strain or different strains, and 
the resulting hybrid having the desired phenotypic 
characteristic identified. Two or more generations may be 

25 grown to ensure that the subject phenotypic characteristic 
is stably maintained and inherited and then seeds harvested 
to ensure the desired phenotype or other property has been 
achieved. 

As noted earlier, the nucleotide sequences of the 
30 invention can be utilized to protect plants from disease, 
particularly those caused by plant pathogens. Pathogens of 
the invention include, but are not limited to, viruses or 
viroids, bacteria, insects, fungi, and the like. Viruses 
include tobacco or cucumber mosaic virus, ringspot virus, 
35 necrosis virus, maize dwarf mosaic virus, etc. Specific 
fungal pathogens for the major crops include: Soybeans : 
Phytophthora megasperma fsp. glycinea, Macrophomina 
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phaseolina, Rhizoctonia solani, Sclerotinia sclerotiorum, 
Fusarium oxysporum, Diaporthe phaseolorum var. sojae 
(Phomopsis sojae), Diaporthe phaseolorum var. caulivora, 
Sclerotium rolfsii, Cercospora kikuchii , Cercospora sojina, 
Peronospora manshurica, Colletotrichum dewatium 
( Col letoti chum truncation) , Corynespor'a cassiicola, Septoria 
glycines, Phyllosticta sojicola, Alternaria alternata, 
Pseudomonas syringae p.v. glycinea, Xanthomonas campestris 
p.v. phaseoli, Microsphaera diffusa, Fusarium semi tectum, 
Phialophora gregata, Soybean mosaic virus, Glomerella 
glycines , Tobacco Ring spot virus, Tobacco Streak virus, 
Phakopsora pachyrhizi , Pythium aphanidermatum, Pythium 
ultimum, Pythium debaryanum, Tomato spotted wilt virus, 
Heterodera glycines Fusarium solani; Canola ; Albugo 
Candida, Alternaria brassicae, Leptosphaeria maculans, 
Rhizoctonia solani, Sclerotinia sclerotiorum, Mycosphaerella 
brassiccola, Pythium ultimum, Peronospora parasitica, 
Fusarium roseum, Alternaria alternata; Alfalfa : davibater 
michiganese subsp. insidiosum, Pythium ultimum, Pythium 
irregulare, Pythium splendens, Pythium debaryanum, Pythium 
aphanidermatum, Phytophthora megasperma, Peronospora 
trifoliorum, Phoma medicaginis var. medicaginis, Cercospora 
medicaginis, Pseudopeziza medicaginis, Leptotrochila 
medicaginis, Fusarium oxysporum, Rhizoctonia solani, 
Uromyces striatus, Colletotrichum trifolii race 1 and race 
2, Leptosphaerulina briosiana, Stemphylium botryosum, 
Stagonospora meliloti, Sclerotinia trifoliorum, Alfalfa 
Mosaic Virus, Verticillium albo-atrum, Xanthomonas 
campestris p. v. alfalfae, Aphanomyces euteiches, Stemphylium 
herbarum, Stemphylium alfalfae; Wheat : Pseudomonas syringae 
p.v. atrofaciens, Urocystis agropyri, Xanthomonas campestris 
p.v. translucens, Pseudomonas syringae p.v. syringae, 
Alternaria alternata, Cladosporium herbarum, Fusarium 
graminearum, Fusarium avenaceum, Fusarium culmorum, Ustilago 
tritici, Ascochyta tritici, Cephalosporium gramineum, 
Collotetrichum graminicola, Erysiphe graminis f .sp. tritici, 
Puccinia graminis f.sp. tritici, Puccinia recondita f .sp. 
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tritici, Puccinia striiformis, Pyrenophora tritici -repentis , 
Septoria nodorum, Septoria tritici, Septoria avenae, 
Pseudocercosporella herpotrichoides , Rhizoctonia solani, 
Rhizoctonia cerealis , Gaeumannomyces graminis var. tritici, 
5 Pythium aphani derma turn, Pythium arrhenomanes , Pythium 

ultimum, Bipolaris sorokiniana, Barley Yellow Dwarf Virus, 
Brome Mosaic Virus, Soil Borne Wheat Mosaic Virus, Wheat 
Streak Mosaic Virus, Wheat Spindle Streak Virus, American 
Wheat Striate Virus, Claviceps purpurea, Tilletia tritici, 

10 Tilletia laevis, Ustilago tritici, Tilletia indica, 
Rhizoctonia solani, Pythium arrhenomannes , Pythium 
gramicola, Pythium aphani derma turn, High Plains Virus, 
European wheat striate virus; Sunflower : Plasmophora 
halstedii, Sclerotinia sclerotiorum, Aster Yellows, Septoria 

15 helianthi, Phomopsis helianthi, Alternaria helianthi, 

Alternaria zinniae, Botrytis cinerea, Phoma macdonaldii, 
Macrophomina phaseolina, Erysiphe cichoracearum, Rhizopus 
oryzae, Rhizopus arrhizus, Rhizopus stolonifer, Puccinia 
helianthi, Verticillium dahliae, Erwinia carotovorum pv. 

20 carotovora, Cephalosporium acremonium, Phytophthora 

cryptogea, Albugo tragopogonis; Corn : Fusarium monili forme 
var. subglutinans , Erwinia stewartii, Fusarium monili forme, 
Gibberella zeae (Fusarium graminearum) , Stenocarpella maydi 
(Diplodia maydis) , Pythium irregulare, Pythium debaryanum, 

25 Pythium graminicola, Pythium splendens, Pythium ultimum, 

Pythium aphani derma turn, Aspergillus flavus, Bipolaris maydis 
O, T (Cochliobolus heterostrophus) , Helminthosporium 
carbonum I, II & III (Cochliobolus carbonum) , Exserohilum 
turcicum I, II & III, Helminthosporium pedicellatum, 

30 Physoderma maydis, Phyllosticta maydis, Kabatiella zeae, 
Colletotrichum graminicola, Cercospora zeae-maydis, 
Cercospora sorghi, Ustilago maydis, Puccinia sorghi, 
Puccinia polysora, Macrophomina phaseolina, Penicillium 
oxalicum, Nigrospora oryzae, Cladosporium herbarum, 

35 Curvularia lunata, Curvularia inaequalis, Curvularia 

pallescens, Clavibacter michiganense subsp. nebraskense, 
Trichoderma viride, Maize Dwarf Mosaic Virus A & B, Wheat 
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Streak Mosaic Virus, Maize Chlorotic Dwarf Virus, Claviceps 
sorghi, Pseudonomas avenae, Erwinia chrysanthemi pv. zea, 
Erwinia corotovora, Cornstunt spiroplasma, Diplodia 
macrospora, Sclerophthora macrospora, Peronosclerospora 
5 sorghi, Peronosclerospora philippinensis , Peronosclerospora 
maydis, Peronosclerospora sacchari, Spacelotheca reiliana, 
Physopella zeae, Cephalosporium maydis, Caphalosporium 
acremonium, Maize Chlorotic Mottle Virus, High Plains Virus, 
Maize Mosaic Virus, Maize Rayado Fino Virus, Maize Streak 
10 Virus, Maize Stripe Virus, Maize Rough Dwarf Virus; Sorghum : 
Exserohilum turcicum, Colletotrichum graminicola (Glomerella 
graminicola) , Cercospora sorghi, Gloeocercospora sorghi, 
Ascochyta sorghina, Pseudomonas syringae p. v. syringrae, 
Xanthomonas campestris p. v. holcicola, Pseudomonas 
15 andropogonis , Puccinia purpurea, Macrophomina phaseolina, 
Perconia circinata, Fusarium monili forme, Alternaria 
alternate, Bipolaris sorghicola, Helminthosporium 
sorghicola, Curvularia lunata r Phoma insidiosa, Pseudomonas 
avenae (Pseudomonas alboprecipitans) , Ramulispora sorghi, 
20 Ramulispora sorghicola, Phyllachara sacchari, Sporisorium 
reilianum (Sphacelotheca reiliana) , Sphacelotheca cruenta, 
Sporisorium sorghi, Sugarcane mosaic H, Maize Dwarf Mosaic 
Virus A & B, Claviceps sorghi, Rhizoctonia solani, 
Acremonium s trie turn, Sclerophthona macrospora, 
25 Peronosclerospora sorghi, Peronosclerospora philippinensis, 
Sclerospora graminicola, Fusarium graminearum, Fusarium 
oxysporum, Pythium arrhenomanes, Pythium graminicola, etc. 

The nucleotide sequences also find use in enhancing 
transformation efficiency by suppressing cell death in 
30 bombarded cells. Thus, the sequences find particular use in 
transformation methods in which programmed cell death 
occurs. The physical wounding of particle bombardment 
triggers programmed cell death. The expression of the cell 
death- suppressor gene in a bombarded cell serves to inhibit 
35 such cell death thereby improving transformation efficiency. 
By "improving efficiency" is intended that the number of 
transformed plants recovered by a transformation event is 
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increased. Generally, the number of transformed plants 
recovered is increased at least two- fold, preferably at 
least five- fold, more preferably at least ten-fold. 

For use in improving transformation efficiency, a cell 
5 death suppressor gene is included in an expression cassette. 
Typically, the gene will be used in combination with a 
marker gene. Other genes of interest may additionally be 
included. The respective genes may be contained in a single 
expression cassette, or alternatively in separate cassettes. 

10 Methods for construction of the cassettes and transformation 
methods have been described above . 

As noted, the cell death suppressor gene can be used in 
combination with a marker gene. Selectable marker genes and 
reporter genes are known in the art. See generally, G. T. 

15 Yarranton (1992) Curr. Opin. Biotech., 3:506-511; 

Christopherson et al . (1992) Proc. Natl. Acad. Sci. USA, 
89:6314-6318; Yao et al . (1992) Cell, 71:63-72; W. S. 
Reznikoff (1992) Mol . Microbiol., 6: 2419-2422; Barkley et 
al. (1980) The Operon, pp. 177-220; Hu et al . (1987) Cell, 

20 48:555-566; Brown et al . (1987) Cell, 49:603-612; Figge et 
al. (1988) Cell, 52:713-722; and, Deuschle et al. (1989) 
Proc. Natl. Acad. Aci. USA, 8^:5400-5404. 

Plant tissue cultures and recombinant plant cells 
containing the proteins and nucleotide sequences, or the 

25 purified protein, of the invention may also be used in an 
assay to screen chemicals whose targets have not been 
identified to determine if they inhibit llsl protein. Such 
an assay is useful as a general screen to identify chemicals 
which inhibit llsl protein activity and which are therefore 

30 herbicide candidates. Alternatively, recombinantly-produced 
llsl protein may be used to elucidate the complex structure 
of the enzyme. Such information regarding the structure of 
the llsl protein may be used, for example, in the rational 
design of new inhibitory herbicides. It is recognized that 

35 both plant and bacterial nucleotide sequences may be 

utilized. The inhibitory effect on the cell -suppressor 
protein may be determined in an assay by monitoring the rate 
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of cell death or alternatively by monitoring the 
accumulation of the activating phenolic compound, 
particularly the para-coumaric ester associated with lesion 
mutants. 

5 If such a chemical is found, it would be useful as a 

herbicide, particularly if plant or bacterial mutant genes 
can be isolated or constructed which are not inhibited by 
the chemical. As indicated above, molecular techniques are 
available in the art for the mutagenesis and alteration of 

10 nucleotide sequences. Those sequences of interest can be 
selected based on resistance to the chemical. Where 
resistant forms of llsl or a . corresponding gene have been 
identified to a chemical, the chemical is also useful as a 
selection agent in transformation experiments. in these 

15 instances, the mutant llsl would be used as the selectable 
marker gene. 

The sequences of the invention also find use to 
genetically target cell ablations. In this manner, dominant 
negative nucleotide sequences can be utilized for cell 

20 ablation by expressing such negative sequences with specific 
tissue promoters. See Figure 3 and SEQ ID NOS 2 & 5-61, 
respectively. For example, stamen promoters can be utilized 
to drive the negative alleles to achieve male sterile 
plants. (See, for example, EPA034402 9 and U.S. Patent No. 

25 5,470,359, herein incorporated by reference). 

Alternatively, cell ablation can be obtained by disrupting 
dominant negative oligonucleotides with a transposable 
insertion. In this manner, very specific or general 
patterns of cell ablations can be created. Additionally, to 

3 0 provide specific cell ablation, antisense oligonucleotides 
for llsl or other genes of the invention can be expressed in 
target cells disrupting the translation which produces the 
cell death suppressor proteins. 

As discussed, the genes of the invention can be 

35 manipulated to enhance disease resistance in plants. In 

this manner, the expression or activity of the llsl or other 
cell death suppressor or disease resistance gene can be 
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altered. Such means for alteration of the gene include co- 
suppression, antisense, mutagenesis, alteration of the sub- 
cellular localization of the protein, etc. In some 
instances, it may be beneficial to express the gene from an 
5 inducible promoter, particularly from a pathogen inducible 
promoter. Such promoters include those from pathogenesis- 
related proteins (PR proteins) which are induced following 
infection by a pathogen; e.g., PR proteins, SAR proteins, 
beta-1, 3-glucanase, chitinase, etc. See, for example, 

10 Redolfi et al. (1983) Neth. J. Plant Pathol. 09:245-254; 

Uknes et al. (1992) The Plant Cell 4:645-656; and Van Loon 
(1985) Plant Mol . Virol. 4:111-116. 

A promoter which is capable of driving the expression 
of genes in a plant cell is additionally provided. The 

15 promoter is inducible. Thus, it may be manipulated to 

express heterologous resistance mechanisms at the site of 
pathogen infection. Accordingly, the promoter is useful for 
driving any gene in a plant cell, particularly genes which 
are needed at the site of infection or wounding. That is, 

20 the promoter is particularly useful for driving the 

expression of disease or insect resistance genes. The 
nucleotide sequence of the promoter is provided in SEQ ID 
NO: 3. 

It is recognized that the nucleotide sequence of the 
25 promoter may be manipulated yet still retain the functional 
activity. Such methods for manipulation include those 
discussed above. Thus, the invention encompasses those 
modified promoter sequences, as well as promoter elements 
retaining the functional activity of the promoter. Such 
30 elements and modified sequences can be assayed for activity 
by determining the expression of a reporter gene operably 
linked to the promoter element or modified promoter 
sequence. 

A genomic DNA sequence comprising the lis gene and 
35 promoter are provided in SEQ ID NO: 4. The sequence can be 
used to construct probes to determine the location and 
organization of similar sequences in other plants, 
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particularly to analyze the location of other cell death 
suppressing and disease resistance sequences. 

The following examples are offered by way of 
illustration and not by way of limitation. 

5 Experimental 
Materials and Methods 

Plant material 

The original llsl mutant, containing the reference 
allele, was obtained from the Maize Genetics Coop., 

10 University of Illinois, Urbana/ Champaign . Stocks containing 
active Mu transposons were obtained from Dr. D. Robertson, 
Iowa State University. The six transposon tagged mutant 
alleles, llsl-1 through llsl-6, were previously designated 
as Us*-29215, 2Is*-42230, I2s*-1127, 22s*-1424, 12s*-3744, 

15 and 22s*-4911, respectively (Johal et al . , (1994), A Tale of 
Two Mimics; Transposon Mutagenesis and characterization of 
Two Disease Lesion Mimic Mutations of Maize, Maydica 39:69- 
76) . 

DNA extraction, RFLP mapping and co- segregation analysis 

20 DNA was isolated by a urea (Dellaporta et al. (1983), 

Plant Molecular Biology Reporter 1:19-22) or CTAB (Hulbert 
et al. (1991) Molecular and General Genetics 225:377-382) 
extraction protocol. DNA samples (15 to 30) from either 
mutant or wild-type plants were pooled and digested 

25 individually with seven restriction enzymes. Southern blot 
analysis was performed as described by (Gardiner et al. 
(19 93) Genetics 134:917-93 0) except that UV crosslinking and 
use of dextran sulfate were omitted. Blots were hybridized 
systematically with specific probes from different Mu 

30 elements. Mapping probes were provided either by the Maize 
Mapping Project* at the University of Missouri or from 
Pioneer Hi-Bred Int. Inc. Pre -hybridizations and 
hybridization of southern blots was performed at 65 °C unless 
otherwise specified. A 3 . 0 kb EcorRI MuQco- segregating DNA 
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marker was cloned from an llsl*-5/llsl-re£ plant using 
standard cloning procedures (Ausubel et al . (1994) Current 
Protocols in Molecular Biology) . The Zap Express™ vector 
(Stratagene) was employed and packaging, screening and in 
vivo excision protocols performed according to manufacturers 
instructions. The primer sequences (SEQ ID NOS 62-64, 
respectively) for confirmation analysis were: GSP1: 5' TGG 
GGA ACT TGA TCG CGC ACG CCT TCG G3 ' , GSP2 : 5' TCG GGC ATG 
GCC TGG GGG ATC TTG G 3 ' , and GSP3 : 5 ' GGC CAC GCG TCG ACT 
AGT AC 3' ( IDT, Coralville IA) . The thermocy cling regime 
used for confirmation analysis was 94 a C for 5 min, then 
cycled 4 0 or 42 times for 3 0 seconds at 94 °C, 1 min and 3 0 
sec at 62°C, and I min at 72*C, and finally 5 min at 72°C. 
Mini -libraries of cloned amplified fragments using the TA 
Cloning4S vector (Invitrogen) were created and individual 
colonies for clones with inserts of the appropriate size. A 
5' RACE fragment was used to screen a pa405 maize seedling 
leaf cDNA library and 3 individual clones were recovered and 
converted to the phagemid form by in vivo excision from the 
Zap Express™ (Stratagene) vector. Primers GSP1 and GSP2 
were used for 5' RACE and GSP3 was used for 3' RACE using 5' 
and 3' RACE Kits and recommended manufacturers instructions 
(GIBCO, MD) . To isolate an llsl genomic clone, a B73 
partial SauIIIA library in lambda Dashll was screened using 
a probe from a 3' RACE product spanning the llsl gene from 
GSP3 to the polyadenylation site. A single positive clone 
was recovered and a 7.129 kb Sad fragment was subcloned 
into pBSKS+ (Stratagene) to create the plasmid pJG201. RFLP 
mapping of the Arabidopsis llsl homolog was performed using 
the Recombinant Inbred (RI) lines generated from a cross 
between Arabidopsis ecotypes Columbia and Landsberg erecta. 
48 RI lines were scored using an EcoRV polymorphism using an 
llsl homolog cDNA as probe. The map position was determined 
on MAPMAKER using the Kosambi mapping function (Lander et 
al. (1987) Genetics 121:174-181). 

Primer extension analysis 
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For primer extension analysis of the maize llsl gene an 
oligonucleotide complementary to the coding strand in the 
llsl gene from 13 9-173 bases downstream of the predicted 
first in-frame ATG was synthesized by DNA Technologies, Inc. 
5 (Coralville, IA) . The oligonucleotide (SEQ ID NO: 65) GSP17 
( 5 ' GTG CTC GGC TCC GCC TGC TCC GCC GCT TCC CCT GG 3 ' ) was 
end- labeled with 32 P. Primer extension analysis was 
performed by the method described by McKnight et al . (1981), 
Analysis of Transcriptional Regulatory Signals of the HSV 

10 Thymidine Kinase Gene: Identificatin of an Upstream Control 
Region, Cell 25:385-3 98, except for the following 
modifications. 40 mg of total RNA from immature tassels of 
a B73 inbred plant and 0.2 pmol of labeled oligonucleotide 
were annealed at one of either 33°C, 37°C, 45°C, or 55°C for 

15 4 hours. Following the extension reaction RNA in the sample 
was removed by adding 2ul of 0 . 5M EDTA and lul of mixed 
RNAases (0.5 mg/ml RNAase A and 10,000 units/ml RNase Tl; 
Ambion) and incubating at 3 7 °C for 3 0 minutes. The primer 
extension products were separated on a 6% denaturing 

20 polyacrylarnide sequencing gel and the size of the extension 
product determined by comparison with a DNA sequence ladder. 

Northern blot analysis 

Total RNA was isolated from leaves of 10 leaf -stage 
wild-type plants in a population segregating for the LeslOl 

25 mutation, Johal and Briggs (1992) Science 255:985-987. mRNA 
was enriched from total RNA using a magnetic bead affinity 
protocol (Dynal Inc. Great Nect NY) . mRNA was isolated from 
A632 inbred plants using the MicroQuick protocol (Pharmacia, 
Piscataway NJ) . Hybridizations were performed either in 

30 modified Church and Gilberts solution at 42 "C or in the 
following hybridization solution at 65° C - 1% casein 
(Technical Grade, Sigma), 1% calf skin gelatin (225 bloom, 
Sigma), 0.2% SDS (Mol. Biol Grade, Fisher), 0.1% Sarkosyl 
(IBI) , 5XSSC. Transfer to nylon membrane (Magnacharge MSI, 

3 5 Westboro MA) was performed by standard protocols, 
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hybridizations were carried out overnight and blots were 
washed as indicated in the results section. 

DNA sequencing and analysis 

DNA sequencing was performed by a cycle sequence method 
5 using a SequiTherm™ Cycle Sequencing Kit (Epicentre, Madison 
WI . ) according to the manufacturers protocol . Local 
sequence comparisons were performed using software including 
ALIGN and MEGALIGN programs of the DNASTAR software package 
(DNASTAR Inc. Madison WI) . Algorithms such as the 

10 neighborhood search algorithm BLAST (Autschul et al. (1990), 
Basic Local Alignment Search Tool, J. Mol. Biol. 215:403- 
410) or BEAUTY (Worley et al . (1995), An Enhanced BLAST - 
based Search Tool that Integrates Multiple Biological 
Information Resources into Sequence Similarity Search 

15 Results, Genome Res . 5:173-184) were employed. Searches of 
the Genbank databases were performed using the National 
Center for Biotechnology Information's BLAST WWW Server with 
links to Entrez and to the Sequence Retrieval System (SRS) 
provided by the Human Genome Center, Baylor College of 

20 Medicine Server at Houston Texas via Internet access. 
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Analysis of light requirement for llsl and dd lesion 
development 

To determine the spectral range of light required for 
lesion formation, sections of leaves were clamped between 
5 0.125 inch Plexiglas GM filters held in place by a metal 
stand with a side arm clamp. The following transparent 
filters were used: Plexiglas GM 2423 (red) , 2711 (Far red) , 
2424 (blue), 2092 (green), 2208 (yellow), and 2422 (Amber) 
or Clear, (Cope Plastics Inc. St. Louis. MO). Transmission 

10 spectra of filters were determined by examining small 

sections of filters in a spectrophotometer. Leaf sections 
of greenhouse or field-grown plants were covered in aluminum 
foil to completely remove incident light. Following 
complete lesioning of a leaf, filters were removed to 

15 observe if lesioning had occurred in the covered region. 

The llsl mutation is cell autonomous and llsl lesions 

exhibit altered phenolic metabolism and callose formation . 

The expression of the llsl phenotype is developmentally 
programmed: a number of round to elliptical lesions often 

2 0 with concentric rings of dead and dying tissue, begin as 

small chlorotic flecks near the tip of the first leaf at the 
three to four leaf stage. While these lesions continue to 
enlarge and eventually coalesce, new lesions initiate down 
the leafblade along an age gradient and cover the whole leaf 
25 within three to four days. Meanwhile, lesions have already 
started near the tip of the second leaf. This pattern 
continues and the plant dies shortly after pollen shed. 
Although the entire leaf tissue becomes necrotic on llsl 
plants, lesions rarely develop on stalks. The llsl mutation 

3 0 is cell autonomous (i.e., the effect of the gene is confined 

to the cell in which it is expressed) as exhibited by both 
revertant sectors (Johal et al. (1994) Maydica, 69-76) and 
forward sectors in that the mutant phenotype does not 
progress into surrounding wild-type tissue. Llsl lesions 
3 5 were examined for callose deposition which is frequently 

associated with response to pathogen infection, wounding or 
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intercellular viral movement (Hammond- Kosack et al. (1996), 
Resistance Gene -dependent Plant Defense Responses, Plant 
Cell 8:1773-1791) . Heavy callousing of all cell types 
within lesions was observed. At the edge of lesions where 
5 cells had not yet collapsed, individual bundle sheath cells 
were the first cells to exhibit callousing of the 
plasmodesmatal fields suggesting that the cells were 
responding to some factor or signal emanating from the dying 
or dead cells. 

10 Mapping of the llsl locus . 

The original llsl allele isolated by Ullstrup and 
Troyer (Ullstrup et al. (1967) Phytopathology 57:1282-1283) 
was used as the reference allele (llsl-ref) . Using a 
combination of cytogenetic and genetic methods, the llsl 
15 gene was initially mapped to the short arm of chromosome 1 
(IS) (Hoisington, (1984) Maize Genetics Newsletter 58:82- 
84) . To map the gene at a higher resolution, a new 
population, in which the progeny segregated 1:1 for 
homozygous and heterozygous llsl plants, was generated. A 
20 W23 inbred plant was fertilized with the llsl pollen derived 
from an llsl-ref /llsl-ref plant, and the resulting progeny 
(two plants) were backcrossed with the llsl-ref homozygotes. 
DNA isolated from 16 mutant and 14 wild-type plants was used 
to examine the linkage with a number of RFLP markers. Three 
25 tightly linked RFLP markers were identified which flank the 
llsl locus. The RFLP marker Php200603 is about 5cm distal 
to llsl, whereas UMC157 is about 8cm proximal to llsl. The 
linkage of llsl with another marker, Php200689, could not be 
broken with these 30 DNAs. All three of these RFLP markers 
30 were invaluable in unequivocally classifying the mutant 
alleles for co-segregation analyses. 

Cloning of the llsl locus bv transposon tagging. 

Due to the lack of biochemical information on the llsl 
mutation, a transposon tagging method was employed to clone 
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the llsl gene. This experimental approach allows genes to 
be cloned solely on the basis of phenotype (Bennetzen et al. 
(1987) , Proceedings of the UCLA Symposium: Plant Gene 
Systems and their Biology, ed, 183-204) . Both targeted and 
5 non-targeted approaches were employed as outlined by (Johal 
et al. (1994) Maydica t 69-76). For the targeted approach, 
llsl -ref /llsl -ref plants were used as male parents and 
crossed with wild-type plants (Llsl/Llsl) from lines active 
for Mu transposition. All Fl plants were expected to be of 
10 wild-type phenotype (Llsl/ llsl-ref) unless a Mu insertion or 
some other mechanism had inactivated the Llsl allele. Such 
an event would result in an llsl*/ llsl-ref plant (llsl* 
refers to a mutant allele generated during transposon 
tagging) with a mutant phenotype. Three plants from 
15 approximately 30,000 Fl progeny exhibited the mutant 

phenotype and one of these died before shedding any pollen. 
The remaining two plants were crossed as male parents to B73 
and Prl inbreds and these two new mutants have been 
designated llsl*-l and llsl*-2 (lls*-29215 and lls*-42230, 
20 respectively, in (Johal et al. (1994) Maydica, 69-76). 

A few of the progeny (10 plants) from the outcross of 
the mutant plants with both inbreds were RFLP genotyped to 
identify plants which had inherited the mutant allele 
(llsl*). Two plants containing the mutant allele were 
25 self-fertilized, and the F2 progeny so derived were found to 
segregate for the llsl phenotype in a 1:3 ratio as expected 
for a recessive mutation. Two other mutant 
allele -containing plants from the outcross progeny were 
backcrossed with the llsl-ref /llslret mutants. The 
30 resultant progeny segregated 1:1 for mutant (llsl*-l or 
- 2111s! -ref ) versus normal plants (Llsl-B73 or 
-Prl /llsl-ref) and were used for co-segregation analysis. 

For non-targeted mutagenesis, Mu-active stocks were 
crossed to an inbred line and the resulting progeny was 
35 self -pollinated to generate F2 (M2) Mutator populations. 
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With this approach, any recessive mutation generated during 
the Fl cross can be detected in the F2 generation. From 
more than 24,000 Mutator F2 families screened, four 
independent families were identified in which one- fourth of 
5 the plants exhibited a phenotype typical of llsl. The four 
mutant alleles have been designated llsl*-3, llsl *4, llsl 
*-5 and llsl *-6. A number of wild-type plants from each of 
these four families were pollinated with the llsl- 
ref/llsl-ref pollen to determine allelism between these new 

10 llsl-like mutants and the original llsl mutant. The 

segregation of llsl mutants in the progeny of most of these 
crosses confirmed allelism between llsl and the new mutants. 
All of these mutants were outcrossed with B73 twice and 
backcrossed to the llslref/llsl-ref mutant to create 

15 populations suitable for co- segregation analysis as 
described above for the targeted mutants. 

The next step was to confirm that the Mu elements 
(there are at least nine of them for Mutator,) had caused 
these new insert ional mutations. This step, called 

20 co- segregation analysis, involved Southern blot analysis to 
detect the linkage of a Mu element with the mutant allele in 
question (Bennetzen et al . (1993) Specificity and Regulation 
of the Mutator Transposable Element System in Maize, Crit . 
Rev. Plant Sci. 12:57-95) . DNA was isolated from 

25 phenotypically mutant and wild-type plants from the 

segregating populations described above for each of the 
mutant alleles. Following identification of a putative 
co- segregating element, the analysis was extended employing 
multiple individual DNA samples digested with an appropriate 

3 0 restriction enzyme. In this manner a 3kb EcoRI restriction 
fragment, hybridizing with the Mu8 specific probe was found 
to co-segregate with 66 DNA samples from the 21sl*-5 
mutation. This co- segregating fragment was cloned and 
sequenced revealing the organization indicated in Figure 1. 

35 The DNA sequence of the right (267bp) flank exhibited 



25 



WO 98/39422 



PCT/US98/04040 



significant homology with an Arabidopsis EST of unknown 
function suggesting that an actual gene was disrupted by the 
Mu8 insertion. On sequencing the 1344 bp left flanking DNA 
no significant homology to known DNA sequences was detected 
5 and a Mu TIR DNA junction (terminal inverted repeats at each 
end of Mu elements) was not observed. Using a Mu TIR primer 
and either an M13 forward or reverse universal primer the 
left flanking (1344bp) or right flanking (267 bp) DNA was 
amplified by PGR and used to probe mutant and wild- type DNA 

10 samples of all mutant alleles. This experiment revealed 
single band polymorphisms in nearly all alleles suggesting 
that this locus was disrupted in several other alleles. 

The occurrence of insertions, in the same locus for 
multiple alleles of the same mutation is considered proof 

15 that the correct locus has been tagged. A PCR based 

approach was used to identify Mu type insertions in the 
vicinity of the cloned region. The right flanking DNA from 
the llsl*-5 clone was sequenced as described above and 
primers designed for extension in each direction. These 

20 primers were used in combination with Mu TIR primers to 
detect amplification products in other mutant allele DNA 
samples but that were absent in their corresponding 
wild-type samples. Two such PCR polymorphisms were 
identified from the targeted allele llsl*-2 and the 

25 non-targeted allele llsl*A. These products hybridized 

strongly on a southern blot with the right flanking DNA from 
allele llsl*-5 indicating that these amplification products 
were amplified from the same locus. In addition, the 
amplification of a smaller (189bp) gene specific fragment 

3 0 was observed in all the mutant and wild- type DNA samples 

from all alleles that hybridized with the right flanking DNA 
of the original llsl*-5 clone. Since all these samples were 
heterozygous for the llsl-ref allele this result indicated 
that the llsl-ref mutation had also resulted from a Mu 

3 5 insertion. Nested PCR using a Mu TIR primer and GSP2 was 
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performed to isolate this fragment. All PCR products were 
directly sequenced using the GSP1 or GSP2 primers as 
sequencing primer and allowed identification of Mu-type 
insertions within 246 bp and 292 bp 5' of the insertion site 
5 of allele llsl*-5 in allele llsl*-2 and llsl*-4 

respectively. It was determined that the llsl-ref allele 
had a Mu insertion at the same site of insertion as that of 
allele llsl*-5. Southern analysis using the lef t-f lanking 
DNA of the llsl* -5 clone revealed that the insertion of a Mu 

10 element in the llsl-ref allele was not accompanied by a 

duplication event showing that the two alleles arose due to 
independent transposition events (explained below) . 

The occurrence of four independent Mutator insertions 
in the same locus in plants with the llsl phenotype but not 

15 their corresponding wild-type siblings constitutes proof 

that a fragment of the llsl locus had been isolated. It was 
observed that a Mu insertion event gave rise to the llsl-ref 
allele which was believed to arise in a non-Mu active 
background, suggesting that cosegregation analysis should be 

20 attempted with an allele of unknown origin before employing 
it in a targeted mutagenesis strategy since the occurrence 
of an insertion in the same region of the gene could 
obfuscate co- segregation analysis with a new allele. 

The llsl locus encodes a novel plant protein 

25 To characterize the llsl locus fully a cDNA and genomic 

clone was isolated. Gene specific primers GSP1 and GSP3 
were employed along with universal primers to amplify 5' and 
3' fragments respectively of the llsl transcript from a cDNA 
library constructed from 2 week old inbred PA405 seedlings. 

30 A 5 ' fragment was then used as a probe to screen the PA405 
cDNA library and 3 individual clones were recovered and the 
longest phagemid named pJG200 was sequenced (Genbank Acc. # 
U77345) . This sequence was used to screen a maize EST 
database and another llsl cDNA with an additional 180 bp at 
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the 5 ' end was recovered. The combined sequence of these 
two cDNAs is shown in SEQ ID NO: 1 and a 521 amino acid 
continuous open reading frame can be predicted from this 
partial transcript (SEQ ID NOS 1 & 2, respectively). The 
5 identification of the termination codon was supported by a 
similarly located predicted termination codon in the 
sequence of an Arabidopsis llsl homolog (below) . A primer 
designed against 13 9bp to 173bp downstream of the predicted 
start codon of this sequence (GSP 17) was used for primer 
10 extension analysis and a 454 bp long primer extension 

product was observed thus predicting a 2119 bp total length 
transcript for the llsl gene. In addition, the 3' ends of 
the cDNAs and the 3' ends of the three PCR-amplif ied 3 '-ends 
were also sequenced and three different polyadenylation 
15 sites determined thus allowing for variation in the size of 
the full length transcript (SEQ ID NO: 1 and Figure 1) . 

A 3' fragment of the llsl gene was utilized to screen a 
partial Sau3A genomic library of the maize inbred line B73 
in order to isolate a full-length llsl gene sequence and a 
20 single positive clone (designated G18) was isolated. A 7129 
bp SacI fragment was subcloned from the G18 genomic clone 
and the resulting plasmid named pJG201 was entirely 
sequenced (Genbank Acc# U77346) . By comparison with the 
cDNA sequence pJG201 was found to contain almost the entire 
25 llsl coding region and a 5' region likely to include the 

entire promoter. The predicted genomic organization of the 
llsl gene (Figure 1) includes 7 exons and 6 introns . The 
Sad restriction site at bp 7129 is 45 bp upstream of the 
predicted stop codon and 320bp upstream of the 
30 polyadenylation sites. Providing that there are no other 
introns in the 5' region of the gene the predicted 
transcriptional start site of the llsl gene occurs at bp 
3115 of the 7129 bp subclone. 

Southern hybridization suggests that the llsl gene is 
35 single copy in the genome of maize since only one band was 
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observed on Southern blots of the wild-type DNA samples of 
the llsl -ref allele cut with several restriction enzymes. 
That a duplicate of the llsl gene exists has not yet been 
determined using lower stringency washes . Three bands were 
observed in llsl* -5 when the EcoRl digested mutant samples 
were probed with the left flank. A 10 bp direct repeat was 
not observed on each side of the Mu8 insertion in allele 
llsl*5. These results suggested that a rearrangement of DNA 
more complex than a simple Mu8 element insertion had 
occurred at this locus and the nature of this rearrangement 
was determined by comparison with the genomic sequence of 
the llsl gene. The left flanking DNA comprises a direct 
repeat of the llsl genomic sequence extending from the EcoRl 
site of Intron II to bp 43 of exon 4. 

The predicted llsl protein exhibits a largely 
hydrophilic protein with a pi of 7.5. No hydrophobic 
regions suggesting membrane association were observed. This 
fact suggests a cytosolic or plastidic subcellular location 
for the LLS 1 protein. 

20 The llsl aene exhibits tissue and cell specific expression 

The llsl phenotype is developmentally expressed as 
described above. LLS1 appears to-be needed in expanded 
leaves but not in very young leaves and thus llsl 
transcripts may accumulate in older leaves if the gene is 

25 transcriptionally regulated. The expression of llsl in 

fully expanded leaves of normal plants was examined using a 
partial cDNA probe that extends from the beginning of exon 2 
to the end of the llsl transcript. A weak signal was 
detected using 20pg of total RNA and a high stringency wash. 

3 0 There did not appear to be a significant gradient in gene 
expression from three successively older leaves. When mRNA 
derived from pooled total RNA from these leaves was utilized 
a single transcript was readily detected (Figure 2). The 
size of this single transcript was estimated at 1.9 + 0.2 kb 

29 
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a figure which coincides closely with the full-length size 
determined by primer extension analysis (1.12 9 kb) . To 
further examine the developmental pattern of llsl gene 
expression, mRNA derived from various plant tissues was 
probed with an 802bp Notl/PstI fragment that extends from 
the end of exon 2 to exon 7 (Figure 1) . Lowest levels of 
expression were seen in seedling leaves, 3 week old embryos 
and in young tassels. The llsl transcript was readily 
detected in more mature tassels, young and old ear shoots 
and 1 week old embryoe. Surprisingly, the llsl transcript 
was most readily detected in seedling roots where the llsl 
phenotype has not been observed. In addition, the presence 
of a second larger transcript (approximately 2.4 kb) was 
observed that hybridizes with the llsl probe in seedling 
roots and older tassel tissue. When observed this larger 
transcript seems to be expressed in equivalent amounts to 
the lower transcript. Since genomic blots have indicated 
that llsl is a single copy gene, the larger transcript may 
represent post-transcriptional regulation of llsl although 
there is precedence for a northern blot to reveal the 
existence of a second gene when a southern analysis failed 
to do. so. These results indicate that the llsl gene is not 
expressed constitutively in all tissues but exhibits tissue 
specific transcriptional regulation. 

The llsl gene is conserved between monocot and dicot plants 

To determine if llsl related genes are present in other 
species or organisms the predicted llsl protein sequence was 
utilized to search public databases of sequences of both 
known and unknown functions. As indicated above, 
significant homology (70% nucleic acid identity) was 
observed between the right flanking DNA of llsl* -5 and an 
expressed sequence tag (EST) from Arabidopsis tha.lia.na. 
(Genbank Acc. # T45298) . Three other Arabidopsis ESTs were 
identified that overlap with this EST (Genbank Acc. #s 
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(dd-1) that is an EMS induced lesion-mimic mutation with 
spreading lesions highly reminiscent of llsl lesions. A 
segregating mapping population was created by crossing a 
dd/dd line with Shangai Red DD/DD and the progeny were 
allowed to self. Plants segregating for drop-dead were 
identified and DNA isolated from several mutant and 
wild-type progeny. A polymorphism for the llsl locus could 
not be identified but a polymorphism for the probe PIO200640 
which is ~33cM distal to llsl was identified with HindE. 
This polymorphism showed complete segregation with 14 mutant 
and 16 wild-type progeny strongly suggesting that this 
mutation maps to a region syntenic with llsl and that llsl 
and dd are homologous mutations and possibly orthologs. 

llsl lesions are induced bv wounding and in leslOl/llal 
double mutants 

In addition to intrinsic, developmental signals, 
external factors also affect llsl expression. llsl lesions 
normally appear randomly on developmentally competent areas 
of the leaf. However, llsl lesions can also be triggered to 
initiate at any site (provided that the tissue is 
developmentally competent) by killing cells either by 
inducing an HR with an incompatible pathogen or by physical 
means (making pin prick wounds) . The additive phenotype of 
the double mutant of llsl with Les2 or Les*-101 (two 
dominant mimics that can initiate numerous lesions on maize 
leaves before they become developmentally competent to 
express llsl) further supports these results. On the double 
mutants, the early phenotype of the lesions is discrete and 
of the respective Les type and also of higher density as 
compared to that of llsl lesions. However, as the tissue 
acquires developmental competence to be able to express the 
llsl phenotype, most, if not all, Les sites transform into 
llsl lesions that expand in an uncontrolled fashion to 
consume the whole leaf. Thus the internal metabolic upset 
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and cell death events associated with a Les*-101 lesion 
appear to act as a trigger for llsl lesions. 

Light is required for llsl and dd lesion formation 

These observations fully support the hypothesis that 
5 llsl functions to contain cell death from spreading, and it 
appears to be critical during late stages of plant 
development. Interestingly, the expression of llsl lesions 
is completely dependent on light. The region in the center 
of the leaf was covered with aluminum foil just as lesions 

10 were initiating at the tip of the leaf. The lesions formed 
progressively down the leaf but not where the leaf was 
protected from light. Aluminum foil also protected lesions 
induced by pin-prick wounding in maize llsl plants and also 
observed clearly in sorghum drop-dead plants . Using plastic 

15 filters that transmit different wavelengths of light, it was 
found that visible light in the spectral region of 650-700 
nm is sufficient for this effect. Yellow and orange filters 
also transmitted some red light in the 650-700 nm so a 
contribution from light in the 560 to 64 0 nm range cannot be 

20 excluded. Lesions did not form when only blue, green, or 
far-red light reached the leaf. This phenomenon suggested 
that active photosynthesis, which harvests light 
pre- dominant ly in the red spectral region, is required for 
lesion formation. This was addressed genetically by 

25 creating double mutants of llsl with iojap 1 (ijl- a 

recessive mutation in maize that produces albino and light 
green sectors on an otherwise normal green leaf) or ncs7 
which also exhibits light green but not albino sectors. 
These double mutants have revealed that llsl lesions can 

3 0 only form in dark green tissues. This result indicates that 
some activity related to light harvest or photosynthesis may 
be important in the initiation and spread of lesions. 
Double mutants of llsl with oil yellow-700 provide further 
support to this interpretation. Oyl - is a dominant mutation 



33 



WO 98/39422 



PCT/US98/04040 



Bacterid . 178:3133-3139) , which are present in the a- 
subunit of all aromatic ring-hydroxylating (ARH) 
dioxygenases involved in the degradation of phenolic 
hydrocarbons. In addition, the spacing (-90 amino acids) 

5 between these motifs, which has recently been shown to be 
conserved in all ARH dioxygenases, is precisely maintained 
in LLS1, adding further evidence that LLS1 may encode a 
dioxygenase function. The ARH dioxygenases consist of 2 or 
3 soluble proteins that interact to form an electron 

10 transport chain that transfers electrons from NADH via 

flavin and iron-sulfur (2Fe-2S) redox centers to a terminal 
dioxygenase. The latter, which is also a multimeric enzyme 
consisting of either a homomers or a and j3 heteromers, 
catalyzes the incorporation of two hydroxyl groups on the 

15 aromatic ring at the expense of dioxygen and NAD ( P ) H . 

The consensus sequence of both the Rieske- and iron- 
binding motifs (SEQ ID NOS 6-7, respectively) as well as the 
spacing between them are precisely conserved in a 
hypothetical protein (translated from an ORF) from 

20 Synechocystis sp. PCC6803, which in addition, exhibits 66% 
amino acid identity to LLS1 among a stretch of more than 100 
amino acids. Additionally, the Rieske center-binding site 
has also been detected in the partial sequence of two 
seemingly related ESTs (SEQ ID NOS 31-32, respectively) of 

25 unknown function, one each from rice and Arabidopsis. 

llel and Cochliobolus carbonum 

Inoculation of llsl leaves with Cochliobolus carbonum 
Race" 1 causes a proliferation of llsl -type necrotic lesions 
in the middle to upper parts of the leaves. These UsI-type 
3 0 lesions superficially resemble C. carJbonuin lesions but they 
are sterile. That is, plating explants on carrot agar 
medium does not usually yield any C. carbonum fungal growth. 
Spontaneous llsl lesions occurring without inoculation are . 
also sterile and appear similar. Thus the lesions induced 
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by C. carbonum inoculation are apparently Usl-type lesions 
and not susceptible C. carbonum lesions. This raises the 
question as to whether these lesions indicate that the llsl 
mutant is susceptible to C. carbonum or not. It seems 
5 likely that the llsl plants are resistant to C. carbonum, 
but that C. carbonum is able to trigger llsl lesion 
formation. The C. carbonum could be acting as a stress that 
sets off the llsl lesion development. After all, even 
abiotic stresses, such as needle pricking, will also induce 

10 llsl lesion formation. 

Inoculation of llsl leaves with Cochiobolus carbonum 
toxin plus or toxin minus causes few if any lesions to form 
in the middle to lower parts of the inoculated leaves. This 
observation is interpreted to mean that the llsl mutation 

15 possesses induced resistance to C. carbonum in that area of 
the leaf. While both spontaneous llsl lesions and C. 
carbonum lesions physically resemble each other, neither 
type was seen in this area of the leaf . In the middle 
transitional area there are some nascent smaller llsl 

20 lesions. It appears as though only the upper acropetal 

areas of the leaf at this stage of development, are capable 
of forming spontaneous llsl lesions or C. carbonum induced 
lesions . 

In the lower-middle areas of llsl leaves without any 
25 pathogen inoculation, a several fold elevation of PR1 and 
chitinase proteins was observed on western blots over that 
of Llsl/llsl wildtype heterozygotes . Upon inoculation, the 
PR1 and chitinase expression in this area of the leaves was 
elevated slightly in llsl and substantially in the Llsl/llsl 
3 0 heterozygotes, such that after inoculation both llsl and the 
wildtype heterozygotes have similar levels of PR1 and 
chitinase. Thus it appears that: 1) elevated PR gene 
expression is correlated with resistance to C. carbonum in 
the lower middle area of the leaves, and 2) the PR gene 
3 5 induction exists prior to the resistance. 
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llsl and Cochiobolus heterostrophus 

As was seen with C. carbonvm, inoculation of llsl 
leaves with Cochiobolus heterostrophus also causes a 
proliferation of llsl-type necrotic lesions in the middle to 
5 upper parts of the leaves. These llsl-type lesions are 
generally distinguishable from C. heterostrophus necrotic 
lesions. These lls-type lesions are also sterile; that is, 
plating explants on carrot agar medium does not usually 
yield any C. heterostrophus fungal growth. Spontaneous llsl 

10 lesions occurring without inoculation are also sterile and 
appear similar. Thus the lesions induced by C. 
heterostrophus inoculation are apparently llsl-type lesions 
and not susceptible C. heterostrophus lesions. It appears 
that C. heterostrophus triggers formation of llsl lesions. 

15 C. heterostrophus appears to be acting as a stress that sets 
off the llsl lesion development. After all, even abiotic 
stresses, such as needle pricking, will also induced llsl 
lesion formation. 

Inoculation of llsl leaves with Cochliobolus 

20 heterostrophus causes few if any lesions to form in the 
middle to lower parts of the inoculated leaves. This 
observation was interpreted to mean that the llsl mutation 
possesses induced resistance to C. heterostrophus in that 
area of the leaf. Spontaneous llsl lesions and C. 

25 heterostrophus lesions are usually distinguishable by 

appearance, yet neither type was observed in this area of 
the leaf. In the middle transitional area there are some 
nascent smaller llsl lesions, so it appears as though only 
the upper acropetal areas of the leaf are capable of forming 

30 llsl lesions. However, the lack of C. heterostrophus 
lesions, in this area of the leaf relative to their 
appearance in Llsl/llsl and Llsl/Llsl wildtype controls, 
indicates that llsl possesses resistance to C. 
heterostrophus in that area of the leaf. That the llsl 

35 heterozygotes are not resistant indicates that this 
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resistance, like llsl lesion formation, is a recessive 
Mendel ian trait. 

In the lower-middle areas of llsl leaves without any C. 
heterostrophus inoculation, a several fold elevation of PR1 
5 and chitinase proteins was observed on western blots over 
that of Llsl/llsl wildtype heterozygotes. Upon inoculation 
with C. heterostrophus, the PR1 and chitinase in this area 
of the leaves is elevated slightly in 22sl and substantially 
in the Llsl/llsl heterozygotes, such that after inoculation 
10 they have similar levels of PR1 and chitinase. Thus it 
appears that elevated PR gene expression is correlated to 
resistance to C. heterostrophus in the lower middle area of 
the leaves, and that this elevated PR gene expression occurs 
prior to the inoculation and resistance. 

15 llsl and Puccinia sorahi (Rust) 

Rust inoculation of llsl plants does not necessarily 
induce Ilsl-type necrotic lesions. It was observed that 
rust will infect llsl plants and produce sporulating 
lesions. This indicates that unlike C. carbonum, C. 

2 0 heterostrophus, and Puccinia sorghi, rust, a biotrophic 

pathogen, is able to infect llsl and Lls/llsl heterozygote 
control plants, The fact that P. sorghi will infect and form 
lesions indicates that P. sorghi can evade triggering llsl 
lesions formation and that it can survive and grow on llsl. 
25 The llsl mutation is therefore not necessarily rust 
resistant. Differences that may exist in rust 
susceptibility in the acropetal versus basipetal regions of 
the leaf have not been investigated. 

Western blots revealed that mutant llsl plants and 

3 0 Llsl/llsl wildtype heterozygote plants had similar levels of 

chitinase expression following rust inoculation. The 
expression of PR1, however, was slightly higher in the 
wildtype plants than in llsl mutants following rust 
inoculation. These experiments seem to indicate that 
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although rust is able to avoid triggering llsl-type lesions 
formation in llsl, it still manages to trigger at least 
chitinase expression. These results may have important 
ramifications for understanding how pathogens are detected 
by the plant host, and if detected, whether by the same or 
different mechanisms, how the signaling pathways determine 
whether PR gene expression activated. 

To date no studies have isolated a protein (s) or 
gene(s) ubiquitously involved in the degradation of plant 
phenolics. Phenolics in plants are often sequestered in 
cell compartments until needed or synthesized only when 
required. Some phenolics however such as benzoic acid and 
salicylic acid have been proposed to play key roles in 
preconditioning cells to undergo cell death during the 
hypersensitive response as described by current models for 
systemic acquired resistance in dicot plants. 

One candidate that may fit well in this role is 
salicylic acid (SA) . SA, which exhibits a 10-50 fold 
increase during the HR and is also triggered in response to 
oxidative stresses associated with ozone or UV exposure 
{ Hammond -Kosack and Jones (1996) Resistance Gene -dependent 
Plant Defense Responses, Plant Cell 8:1773-1791); Ryals et 
al. (1996) Systemic Acquired Resistance, Plant Cell 8:1809- 
1819), is known to cause H 2 0 2 buildup (Chen et al. (1993) 
Involvement of Reactive Oxygen Species in the Induction of 
Systemic Acquired Resistance by Salicyclic Acid in Plants, 
Science 242:883-886) and transmute into a cell damaging free 
radical under oxidinzing conditions (Durner and Klessig 
(1996) Salicylic Acid is a Modulator of Tobacco and 
Mammalian Catalases, J. Biol. Chem. , 271:28492-28501) . 
These attributes of SA indicate that it may be a mediator of 
cell death in llsl mutants, a hypothesis fuly compatible 
with the demonstrated dependence on SA of cell death 
associated with a number of Arabidopsis lsd mutants (Dangl 
et al. (1996) Death Don't Have no Mercy: Cell Death 
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Programs in Plant-microbe Interactions, Plant Cal 1 5; 1793- 
1807; Weyman et al. (1996) Suppression and Restoration of 
Lesion Formation in Arabidopsis Isd mutants, Plant Ce] 1 
12:2013-2022) . However, as noted above, the possibility 
5 nevertheless remains that a novel compound or mechanism is 
responsible for llsl-associated cell death. 

The predicted association of LLS1 with an iron- sulfur 
cluster suggests that it may also participate in oxidation- 
reduction reactions. Proteins that use iron-sulfur clusters 

10 as prosthetic groups often function as biosensors of 

oxidants and iron (Roualt and Klausner (1996) Iron-sulfur 
Clusters as Biosensors of Oxidants and Iron, Trends Biochem. 
Sci . 21:174-177) . LLS1 may also serve as a kind of rheostat 
such as that proposed for LSD1 in regulating cell death in 

15 plants (Jabs et al. (1996) Jnitiation of Runaway Cell Death 
in an Arabidopsis Mutant by Extracellular Superoxide, 
Science 273:1853-1856) . 

Working model for llsl function 

As noted earlier, the present invention is not 
20 dependent upon a particular mode of action. However, it is 
predicted that the LLS1 protein functions to inhibit the 
action of a cell "suicide factor" by degrading that factor. 
The suicide factor is a phenolic compound that is either a 
toxin or signal associated with photosynthetic stress or 
25 wounding or due to metabolic upset in the case of 

llsl/LeslOl double mutants. Phenolics can cause superoxide 
production formation by donating an electron to dioxygen 
while in a semiquinone form (Appel (1993) Phenolics in 
Ecological Interactions.: The Importance of Oxidation, 
30 Chem. Ecol. 19:1521-1552) . Photosynthetic organisms have 
evolved multiple mechanisms to dissipate excess energy and 
avoid the production of reactive oxygen intermediates (ROD 
during photosynthesis. Free-radicals are scavenged by 
ascorbate, carotenoids, the xanthophyll cycle, 
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alpha -tocopherol, glutathione, and various phenolics 
(Alscher et al . (1993), Antioxidants in Higher Plants). The 
oxidative state of a cell influences dramatically the 
ability of phenolics to promote free radical formation 
5 (Appel (1993) Phenolics in Ecological Interactions: The 
Importance of Oxidation, J. Chem. Ecol . 19:1521-1552). The 
development of llsl lesions could result in cell death due 
to the inability to remove a toxic phenolic or signal that 
has accumulated in a cell. 
10 Whereas a toxin may directly inhibit basic metabolic 

processes a signal may trigger a programmed cell death 
pathway that is reminiscent of the hypersensitive response. 
Lesions thus spread because the release of the contents of 
dying cells cause oxidative stress in surrounding cells and 
15 result in the autocatalytic production of the cell suicide 
factor. Alternatively a signal for cell death may activate 
cell death programs in surrounding cells unless it is 
removed. The developmental gradient of llsl lesion 
expression may reflect the accumulation of a suicide factor 
2 0 in older cells. Young tissue does not form lesions when 
wounded and this may reflect the lack of accumulation of a 
suicide factor, the inability to yet synthesize that 
compound or the existence of a juvenile llsl homolog. 
Protection of the plant tissue from light would directly 
25 reduce the concentration of the suicide factor and avoid 

lesion formation. The concentric circle appearance of llsl 
lesions may thus result from variation in the production of 
the suicide factor due to diurnal light cycles. Revertant 
sectors would be resistant to this suicide factor and the 
30 ability of lesions to "traverse" pale green or albino 

sectors in llsl/llsl io/io or llsl/llsl NCS7 double mutants 
would reflect the concentration and dif fusibility of the 
toxic phenolics across tissues less able or unable to 
produce the suicide factor. In normal tissues functional 
35 LLS1 limits the effect of a suicide factor released in the 
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process of wounding or stress. Finally it is expected that 
if LLS1 affects phenolic metabolism that a change in 
phenolic profile would occur in llsl plants. Significantly, 
this prediction is supported by the report that a 
5 para-coumaric ester accumulates in llsl lesioned plants but 
not in normal wild-type siblings or wild-type siblings 
inoculated with the fungus Cochliobolus he ter ostrophus 
(Obanni et al . (1994) Phenylpropanoid accumulation and 
Symptom Expression in the Lethal Leaf Spot Mutant of Maize, 
10 Phvsiol. Mol. Plant Path. 44:379-388) . 

llsl may play a role in the Hypersensitive Response 

A complex series of cellular events is envisaged to 
occur during the activation of defense responses in plants 
(Hammond-Kosact et al . (1996) Resistance Gene -dependent 

15 Plant Defense Responses, Plant Cell 8:1773-1791). 

Incompatible responses will often lead to the death of an 
infected cell within a few hours of infection. There is 
considerable evidence that this hypersensitive response (HR) 
is a form of programmed cell death activated by the plant 

20 cell. Lesion mimic mutations may cause an uncoupling of the 
regulatory steps of this process. Recent evidence has shown 
that control of cell death involves checkpoints that 
negatively and positively modulate the decision to progress 
to cell collapse. Evidence is provided by the observation 

25 that the lesion mimic phenotype of the lsdl and lsd6 

mutations of Arabidopsis are suppressed in the presence of 
the transgene nahG which degrades salicylic acid (SA) . 
Application of 2,6 dichlorisonicotinic acid (a chemical 
inducer of systemic acquired resistance - SAR) restored 

30 lesion phenotype of these mutants (Dangl et al . (1996) Plant 
Cell 8:1793-1807). This result directly implicates SA in 
the signalling pathway that leads to cell death in these 
lesion mimics and that normally LSD1 and LSD6 would serve to 
negatively modulate that pathway. acdl plants form 
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spreading lesions in the presence of a functional Isdl gene 
suggesting that ACD1 operates downstream or on a separate 
pathway from LSD1 . Also there is evidence to indicate that 
SA donates an electron to catalase and in so doing becomes a 
free radical which interacts with membrane lipids to promote 
lipid peroxides which further promote membrane damage and 
cell collapse. Collectively these results suggest that acdl 
functions downstream of Isdl to inhibit a cell death pathway 
that is promoted by superoxide via SA and it may be that 
acdl transcription is activated by LSD1. ACD1/LLS1 may 
degrade SA and thus negatively regulate a signalling pathway 
that could lead to runaway promotion of cell death. 
ACD1/LLS1 may be positively regulated by competing sensors 
of well being within the cell via the LSD1 protein and or 
other activators. Thus in an llsl mutant what normally may 
constitute a minimal stress may become exaggerated through a 
runaway amplification loop and cell death pathways may be 
triggered resulting in lesion formation. This model 
predicts that nahG in an acdl/acdl mutant will abolish 
lesion formation. 

Cell death mechanisms in plants versus animals — 

Lesion mimic genes are now providing insight into the 
kinds of genes involved in regulating cell death in plants. 
Three lesion mimic genes have now been cloned and do not 
have related counterparts in animal systems. This suggests 
that cell death is regulated in plants in a manner very 
different from models describing cell death regulation in 
animals although a role for ROI seems common to both 
systems. The recently cloned mlo locus from barley has been 
shown to encode a membrane protein and the Isdl gene from 
Arabidopsis may encode a transcriptional activator. Both of 
these genes may normally serve to interpret external or 
internal stress signals and when mutated turn on or of f 
other genes that cause cell death or cell survival 



43 



WO 98/39422 



PCT/US98/04040 



respectively. The llsl gene appears to be encode an enzyme 
involved in suppressing the spread of cell death through 
some aspect of phenolic metabolism. Phenolic production has 
long been long associated with cell death in plants but 
little understood at the molecular level. Studies of the 
cloned llsl gene may afford unexpected insights into this 
important aspect of plant physiology. 

Expression profile of lethal leaf spot 1 (llsl) 

In leaves 2 and 4 of 16 -days-olds wild- type seedlings 
(Mol7, B73), the strongest expression of llsl is seen in 
both upper and lower epidermis and its derivatives (such as 
silica cells) , in sklerenchyma cells on either side of 
vascular bundles, and in protoxylem elements. A weaker, but 
clearly discernible expression signal is observed in bundle 
sheath, mesophyll cells and midrib parenchyma. Expression 
is undetectable in metaxylem, phloem and companion cells. 

In 7 -day-old darkgrown wild- type seedlings (B73) , llsl 
expression can be detected at low levels in a uniform 
distribution throughout most leaf cells. Slightly elevated 
levels can be found in coleoptile and midrib of the two 
oldest leaves. 

In leaves of the dominant lesion mimic mutant Les 101, 
and in the llsl mutant itself, expression of llsl is 
essentially the same as in wild-type. 

For in situ expression analysis of llsl, a 0.7kb Notl- 
Pstl fragment from the middle of the cDNA was used to make 
labeled sense and antisense riboprobes. 

* Clones comprising the genomic sequence and cDNA 
sequence described herein were deposited on 14 November 1996 
with the American Type Culture Collection, Rockville, 
Maryland, and given accession numbers ATCC 97791 and ATCC 
97792 . 

All publications and patent applications mentioned in 
the specification are indicative of the level of those 



44 



WO 98/39422 PCTYUS98/04040 

skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated 
by reference to the same extent as if each individual 
publication or patent application was specifically and 
5 individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in 
some detail by way of illustration and example for purposes 
of clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the scope 
10 of the appended claims. 
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(ii) TITLE OF INVENTION : METHODS AND COMPOSITIONS FOR CONTROLLING 
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(iii) NUMBER OF SEQUENCES : 65 

(iv) CORRESPONDENCE ADDRESS : 
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(D) STATE: North Carolina 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 15.. 1574 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 
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CGTGCGGGGA GAAT ATG CGC GCG ACA ATC CCA GCC CTG TCG CTC CTG GTG 50 
Met Arg Ala Thr lie Pro Ala Leu Ser Leu Leu Val 
15 10 

ACG CCG CGG CTC CCC TCG CTC GCC GTG CCG CTG GCT GGA GGC CGC CTC 98 
Thr Pro Arg Leu Pro Ser Leu Ala Val Pro Leu Ala Gly Gly Arg Leu 
15 20 25 

CGC GAG GGC GGT CGT TCT CGG ACC CGC CTC CGC GTG GCG GCG CCG ACG 146 
Arg Glu Gly Gly Arg Ser Arg Thr Arg Leu Arg Val Ala Ala Pro Thr 
30 35 40 

TCC GTA CCA GGG GAA GCG GCG GAG CAG GCG GAG CCG AGC ACG TCG GCG 194 
Ser Val Pro Gly Glu Ala Ala Glu Gin Ala Glu Pro Ser Thr Ser Ala 
45 50 55 60 

CCC GAG TCC GGC GAG AAG TTC TCG TGG AGG GAT CAC TGG TAC CCG GTC 242 
Pro Glu Ser Gly Glu Lys Phe Ser Trp Arg Asp His Trp Tyr Pro Val 
65 70 75 

TCC CTC GTC GAG GAC CTC GAC CCC AGC CGC CCC ACC CCG TTC CAG CTC 290 
Ser Leu Val Glu Asp Leu Asp Pro Ser Arg Pro Thr Pro Phe Gin Leu 
80 85 90 

CTC AAC CGC GAC CTC GTC ATC TGG AAG GAA CCC AAG TCC GGC GAG TGG 338 
Leu Asn Arg Asp Leu Val lie Trp Lys Glu Pro Lys Ser Gly Glu Trp 
95 100 105 

GTC GCG CTC GAC GAC CGC TGC CCC CAC CGC CTT GCC CCG CTC TCG GAG 3 86 

Val Ala Leu Asp Asp Arg Cys Pro His Arg Leu Ala Pro Leu Ser Glu 
110 115 120 

GGC AGG ATC GAT GAG ACG GGG TGC TTG CAG TGC TCG TAT CAC GGA TGG 434 
Gly Arg lie Asp Glu Thr Gly Cys Leu Gin Cys Ser Tyr His Gly Trp 
125 ~ 130 135 140 

TCA TTC GAT GGC TCC GGC GCC TGC ACC AAG ATC CCC CAG GCC ATG CCC 4 82 

Ser Phe Asp Gly Ser Gly Ala Cys Thr Lys He Pro Gin Ala Met Pro 
145 150 155 

GAG GGT CCT GAG GCC CGT GCG GTG CGG TCA CCG AAG GCG TGC GCG ATC 53 0 

Glu Gly Pro Glu Ala Arg Ala Val Arg Ser Pro Lys Ala Cys Ala He 
160 165 170 

AAG TTC CCC ACC CTC GTC TCC CAG GGG CTG CTC TTC GTG TGG CCC GAT 578 
Lys Phe Pro Thr Leu Val Ser Gin Gly Leu Leu Phe Val Trp Pro Asp 
175 180 185 

GAG AAT GGG TGG GAG AAA GCG GCC GCC ACC AAG CCT CCA ATG TTG CCG 626 
Glu Asn Gly Trp Glu Lys Ala Ala Ala Thr Lys Pro Pro Met Leu Pro 
190 195 200 

AAA GAA TTT GAG GAC CCG GCC TTC TCC ACG GTG ACA ATC CAG AGG GAC 674 
Lys Glu Phe Glu Asp Pro Ala Phe Ser Thr Val Thr He Gin Arg Asp 
205 210 215 220 

TTG TTC TAT GGT TAT GAT ACG TTG ATG GAG AAC GTC TCT GAT CCG TCC 722 
Leu Phe Tyr Gly Tyr Asp Thr Leu Met Glu Asn Val Ser Asp Pro Ser 
225 230 235 
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CAT ATA GAA TTT GCT CAC CAC AAG GTT ACT GGA CGA AGA GAT AGA GCC 770 
His He Glu Phe Ala His His Lys Val Thr Gly Arg Arg Asp Arg Ala 
240 245 250 

AGG CCT TTG ACA TTC AGG ATG GAA TCA AGT GGT GCC TGG GGT TAC TCA 818 
Arg Pro Leu Thr Phe Arg Met Glu Ser Ser Gly Ala Trp Gly Tyr Ser 
255 260 265 

GGA GCA AAT TCT GGT AAT CCT CGC ATT ACT GCA ACT TTT GAG GCC CCT 866 
Gly Ala Asn Ser Gly Asn Pro Arg He Thr Ala Thr Phe Glu Ala Pro 
270 275 280 

TGT TAT GCA TTG AAC AAA ATA GAG ATA GAC ACA AAG TTA CCC ATT TTT 914 
Cys Tyr Ala Leu Asn Lys He Glu He Asp Thr Lys Leu Pro He Phe 
285 290 295 300 

GGC GAC CAG AAA TGG GTC ATA TGG ATT TGC TCT TTC AAC ATT CCA ATG 962 
Gly Asp Gin Lys Trp Val He Trp He Cys Ser Phe Asn He Pro Met 
305 310 315 

GCC CCA GGG AAG ACT CGT TCT ATT GTC TGT AGC GCT CGA AAC TTT TTC 1010 
Ala Pro Gly Lys Thr Arg Ser He Val Cys Ser Ala Arg Asn Phe Phe 
320 325 330 

CAG TTC ACA ATG CCA GGA AAA GCA TGG TGG CAG CTT GTT CCT CGA TGG 1058 
Gin Phe Thr Met Pro Gly Lys Ala Trp Trp Gin Leu Val Pro Arg Trp 
335 340 345 

TAT GAA CAT TGG ACT TCA AAT TTG GTC TAT GAT GGC GAT ATG ATC GTT 1106 
Tyr Glu His Trp Thr Ser Asn Leu Val Tyr Asp Gly Asp Met He Val 
350 355 360 

CTT CAA GGC CAG GAG AAG ATT TTC CTA GCT GCA ACC AAG GAG TCT TCT 1154 
Leu Gin Gly Gin Glu Lys He Phe Leu Ala Ala Thr Lys Glu Ser Ser 
365 370 375 380 

ACG GAT ATT AAT CAG CAG TAC ACA AAG ATC ACA TTC ACG CCC ACA CAA 1202 
Thr Asp He Asn Gin Gin Tyr Thr Lys He Thr Phe Thr Pro Thr Gin 
385 390 395 

GCT GAT CGA TTT GTT TTA GCA TTC CGC ACA TGG CTA AGG AAA TTT GGC 1250 
Ala Asp Arg Phe Val Leu Ala Phe Arg Thr Trp Leu Arg Lys Phe Gly 
400 405 410 

AAT AGC CAG CCG GAG TGG TTT GGA AAT CCT ACA CAA GAA GCA TTG CCT 1298 
Asn Ser Gin Pro Glu Trp Phe Gly Asn Pro Thr Gin Glu Ala Leu Pro 
415 420 425 

TCC ACC GTC CTT TCA AAG CGC GAG ATG CTA GAC AGA TAC GAG CAG CAC 1346 
Ser Thr Val Leu Ser Lys Arg Glu Met Leu Asp Arg Tyr Glu Gin His 
430 435 440 

ACG TTG AAA TGC TCG TCC TGC AAA GGA GCA TAT AAT GCA TTC CAG AAT 13 94 

Thr Leu Lys Cys Ser Ser Cys Lys Gly Ala Tyr Asn Ala Phe Gin Asn 
445 450 455 460 

CTG CAG AAG GTA TTC ATG GGA GCG ACA GTA GTT TGC TGT GCT GCC GCT 1442 
Leu Gin Lys Val Phe Met Gly Ala Thr Val Val Cys Cys Ala Ala Ala 
465 470 475 
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GGT ATT CCT CCA GAT GTT CAG CTC AGG TTA TTG ATC GGT GCG GCT GCT 1490 
Gly He Pro Pro Asp Val Gin Leu Arg Leu Leu He Gly Ala Ala Ala 
480 485 490 

TTG GTC AGT GCC GCT GTA GCA TAC GCA TTC CAT GAG CTC CAG AAG AAT 153 8 

Leu Val Ser Ala Ala Val Ala Tyr Ala Phe His Glu Leu Gin Lys Asn 
495 500 505 

TTT GTA TTC GTG GAT TAC GTG CAT GCT GAC ATT GAT TGAAAGATTC 1584 
Phe Val Phe Val Asp Tyr Val His Ala Asp He Asp 
510 515 520 

GTGAGGATCT GTTGTGCGAC ATCACTGGCT CGCGAGTCGT GTCTGTAGTC TAGGGCTCTA 1644 

GGCGTCTAGC TAGGGAAAGT AACTTTTTGC CGGGTATAGG TCATATTGCT CACATATGTA 1704 

TTTTGTATAG TGTATGCACT CAACTGTAGC CGATTCAGTG CGAAAATATA GTTTTTATGT 1764 

TACTATCTAT TGGATTAAAA TTGTCTCCAG ATCCTTTTAG CATGTAAAAT GCCATTTTTC 1824 

AAATGGAAGT TCTCAATTGC GCCCCTAGAC T 1855 



{2} INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Arg Ala Thr He Pro Ala Leu Ser Leu Leu Val Thr Pro Arg Leu 
15 10 15 

Pro Ser Leu Ala Val Pro Leu Ala Gly Gly Arg Leu Arg Glu Gly Gly 
20 25 30 

Arg Ser Arg Thr Arg Leu Arg Val Ala Ala Pro Thr Ser Val Pro Gly 
35 40 45 

Glu Ala Ala Glu Gin Ala Glu Pro Ser Thr Ser Ala Pro Glu Ser Gly 
50 55 60 

Glu Lys Phe Ser Trp Arg Asp His Trp Tyr Pro Val Ser Leu Val Glu 
65 70 75 80 

Asp Leu Asp Pro Ser Arg Pro Thr Pro Phe Gin Leu Leu Asn Arg Asp 
85 90 95 

Leu Val He Trp Lys Glu Pro Lys Ser Gly Glu Trp Val Ala Leu Asp 
100 105 HO 

Asp Arg Cys Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg He Asp 
115 120 125 

Glu Thr Gly Cys Leu Gin Cys Ser Tyr His Gly Trp Ser Phe Asp Gly 
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130 135 140 

Ser Gly Ala Cys Thr Lys He Pro Gin Ala Met Pro Glu Gly Pro Glu 
145 150 155 i6o 

Ala Arg Ala Val Arg Ser Pro Lys Ala Cys Ala He Lys Phe Pro Thr 
165 170 J 175 

Leu Val Ser Gin Gly Leu Leu Phe Val Trp Pro Asp Glu Asn Gly Trp 
180 185 " 190 

Glu Lys Ala Ala Ala Thr Lys Pro Pro Met Leu Pro Lys Glu Phe Glu 
195 200 205 

Asp Pro Ala Phe Ser Thr Val Thr He Gin Arg Asp Leu Phe Tyr Gly 
210 215 220 

Tyr Asp Thr Leu Met Glu Asn Val Ser Asp Pro Ser His He Glu Phe 
225 230 235 2 40 

Ala His His Lys Val Thr Gly Arg Arg Asp Arg Ala Arg Pro Leu Thr 
245 250 255 

Phe Arg Met Glu Ser Ser Gly Ala Trp Gly Tyr Ser Gly Ala Asn Ser 
260 265 270 

Gly Asn Pro Arg He Thr Ala Thr Phe Glu Ala Pro Cys Tyr Ala Leu 
275 280 285 

Asn Lys lie Glu lie Asp Thr Lys Leu Pro lie Phe Gly Asp Gin Lys 
290 295 300 

Trp Val lie Trp He Cys Ser Phe Asn He Pro Met Ala Pro Gly Lys 
3 05 310 315 320 

Thr Arg Ser lie Val Cys Ser Ala Arg Asn Phe Phe Gin Phe Thr Met 
325 330 335 

Pro Gly Lys Ala Trp Trp Gin Leu Val Pro Arg Trp Tyr Glu His Trp 
340 345 ' " 350 

Thr Ser Asn Leu Val Tyr Asp Gly Asp Met lie Val Leu Gin Gly Gin 
355 360 365 

Glu Lys lie Phe Leu Ala Ala Thr Lys Glu Ser Ser Thr Asp He Asn 
370 375 380 

Gin Gin Tyr Thr Lys lie Thr Phe Thr Pro Thr Gin Ala Asp Arg Phe 
385 390 395 " 400 

Val Leu Ala Phe Arg Thr Trp Leu Arg Lys Phe Gly Asn Ser Gin Pro 
405 410 415 

Glu Trp Phe Gly Asn" Pro Thr Gin Glu Ala Leu Pro Ser Thr Val Leu 
420 425 430 

Ser Lys Arg Glu Met Leu Asp Arg Tyr Glu Gin His Thr Leu Lys Cys 
435 440 445 

Ser Ser Cys Lys Gly Ala Tyr Asn Ala Phe Gin Asn Leu Gin Lys Val 
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450 455 460 

Phe Met Gly Ala Thr Val Val Cys Cys Ala Ala Ala Gly lie Pro Pro 
465 470 475 480 

Asp Val Gin Leu Arg Leu Leu He Gly Ala Ala Ala Leu Val Ser Ala 
485 490 495 

Ala Val Ala Tyr Ala Phe His Glu Leu Gin Lys Asn Phe Val Phe Val 
500 505 510 

Asp Tyr Val His Ala Asp He Asp 
515 520 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCAACGCACA CAGACAGGCA GCGATGTCTT TCGCGGGTCA GTAAACCTCA CTCACACAGG 60 

CTATTCGTCT TAAGTTTTTT TGTTCAACAT CACATACTTG TGTTGCTAAT GTAACAAAAA 120 

AAATTCACAC GCCTCACAAA CATTACAATA TGATTCAAAA TAGACACTAA CCAAACCTTG 180 

GAGGACTTTG TACTGGCTAG AGAACACCTA CTCTACTGCT ATGCTGCTTA CCCGAGACAG 240 

AGGAAATACA CACGAGCAAC TGTTGTGGAC TTGTTGCAAA ATAGCAAGGA AAGGTATTAG 300 

TAATAGCAAG CATAATTGTA GGAGCTGCAA GTATAACAAT GATAGTCTGC TCTTTAGTAC 3 60 

CTTACATGTA TGAAATAAAA AACTATATAG GTAAAGTGAA CAACATGCGT TATGTAAATC 420 

TAGCAGACTA TTGGATTGAA AAGAATTCAA TTACAAGGAC AAAGAATGAC TGACGAGGGC 480 

AGCAACACAA TAACTAAATG TTCCAAAATG GTCAGATATG AAGGGCTCGA ACGCATGCAC 540 

GGCATGATAT GCTAGTTGGG GCCGTTTCCG TCGGGCTTTA AAGATAAGGA AATCTGGATA 600 

TGGACTAATG ATGTCTAATT TTTGTTAGAG CCTAGCGCCC TAGCATGCTA ACTAGAAGGT 660 

TAATTTTGTT TCTATTTTTT GTTGCACCGA CTGAGCCAAC ATTCTTTTGT CTAGTAGTTT 720 

ACATTTTAGT TACTACTCTC TTCGTCTAAA AAGTACTATA TCTCCATTTT TTAAAATGTC 780 

TTGCTTTTTG AAGAGCACTA TCTTTTAAAA TCTTGACCAA CTATATAAAA GTACTTCTGA 840 

TACATGATAG GTTTAATAAA ATATATAAAA TCTTATATTT TTAGTAAGTC TAGTCAAACT 900 

TAAGAGCTTT TGATGTCGCA CATAGTTGTT TTAAACAAGG TGTTTGTTCA TGTTCGTTCT 960 
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AATATGTGGA TAGTATTCCG ATTCATTTCG CCAGAGGTGT GGCTGTGGAT ATTTGGTTAG 1020 

AGCATCTTCA AGAAAACCCG TAAATCAACT CCAAAAACGT TTTGAGCCTC CCAACAGTCC 1080 

CCCTTCCCCT CCCCATATTA CGCGTCAAGC ATTGTTCCCA ATCGTCCTCT GCGCATGCTG 114 0 

GTTCCCACGT GTATTTTCCT CGCGCGCAGT TCTGTTGGAG GAGGAAGGCG GGACGTTGGC 1200 

ACTAGCGCTG GCTGGAGATT ATGGCCATCG CAATCAGTTT GTGGCAGTCA AATGCTTTGT 1260 

TTTTTTGGCC GCTCATGTGA GTATCATTTC TGTGAAAACT ATCTAAATCA ATATGAATGT 1320 

ATATTTCTTT AAGTCGTCAC GATAGGAAGA CTCCATCGTT CTAAAACCTA AACCATGCAC 1380 

ACATATTCAT CTTTCTCCAA ACGCAAGTCT CGTGATATTT ATATTCTCGT GCCAGCTAGA 1440 

TTATCTAGAA ATTTAGATTC TTAAAAAAAT TCTTTAGAAA AAAAATTATA CCAAACAGGA 1500 

CCATGGTTTA AACTATTACG GATAAATAGC ATGACTACCT TAGTATTTAA ATGATATCAG 1560 

TTGAAATATG TCGACTTATT TTATAGTTAG TATTATTAGA ACATGTTTAA ATAATTATCA 1620 

CATTTAAACC AGATCTACAT ATAAACTATT TTGCTTGTCA ACTGCATCGC AAACTCACTT 1680 

GCCTACCATC GGGATCGCGC TCGTATACAA GTGACACACT TTAAATGATT TAAGCCGCGA 1740 

AAATTATAAA TGTACCATCC TCATTTGGCA AGTCTAAAGA TAGCTTTACC ATACAAATGA 1800 

AACTAAATTT AAAATTCCAA GTAATAATTA GAAAAACTGA TTTGACAGTT TTTTCAGTAT 1860 

ATATTTAGCA GCTCGCTAAA TCTGAATTTA GAAAGTTTTT TTGAAATGAG TTGAGATGCT 1920 

CTTATAATGG TTACTATAGG TTGAGGGACG GAAGTAGTAG TAGAACTGGT AAACAAATTC 1980 

GAATTTGATC TATTCAACTT TGTAGCTACT CAGCAAGATG CGAATTGCAA ACATCCGGCG 2040 

GGGTGGATTC CGCCACGGCC CACGGGTGGG TTCGTGTCGT TCTCACCGCC GGTCAATCTC 2100 

CCCTCCGCGC GGCGCAATTC GTCCCGGTGG GGACGGCTAG CTGGCCCAAT GCCAAAGCTC 2160 

CACCGACAAA TGCCGCAAAG CGCCATGCGT GGTCGCGTAC AATTGCCTCC TTCCCCGCCC 2220 

TTCCTCCCTT CCCTGCCGTG ACGCAACCAC ACTGCGCTCA CCATCGTGTA CAATGTATTC 22 80 

TCCCTAGCCG AACCGTATCA GTAGTTCTTA GGGGTGGGCG TTCGGGTTAC CCGAAATTTT 234 0 

CGGGTTGGGT AATTCAAGTT TTTTAAATTT CGGGTTTTGA GAATCAATAC CCGAAATTAC 2400 

AACGGATTTT TCAATACCCG GAATTTCGGG TACCCGGAAT TTCGGGTTCG GGTTCGGGTA 2460 

TTCCCAAACT ACCCGAACTA TTGTGTTGGC TTCATAAAAA CACATACACC CTATTAAATT 252 0 

AGTATAAAAA TATAGTTTGA ATAATGATAT ACATGGACAT ATAAAACACA AACAATCTAC 2580 

AATCCCAAGT TATGCACACT TACACATAAT TATAGATGTA CAAACTTAAA TTATTAAAGC 264 0 

ATGACATGAG TACATGACAC ATGAAAGCCG GGTAATTCGG GTATTTCGGG TACCCGATTG 2700 

TGATACCCGA ATTACCCGAA ATAATTTCGG GTTTTGCAAG TTGCTACCCG AAATTCCCAA 2760 
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ACAAAATTCG GGTTTCGGGT ATTTCGGGTT CGGGTTCGGG TATTCCAGGT TTGGGTTTCG 2 820 
GG 2822 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4015 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

TTACGGGTTT TTTGCCCAGC CCTACTAGTT CTTCCCTCGC GTTCACTCCC CAGCGTGGGA 60 

AAATCCCGGA ATTTTCTTGT TTGTCCACTG GTTTTCTTGC GCCAAAACCA GGTTTCTCCC 120 

CGTTGCCGTG GCAGAACTCT GTTCTTGCCC AGTCTAGAAG ATCTGCACCG TTCCAACCAC 180 

CGACTCCGGC CGCCAAGCAT ATAGCCAGCG CGGCGAAGAA TTCCCAACGC GAAAGCCAAA 24 0 

ACCTCTTCAC TTCACTTCAC GTCGACACGT GCGGGGAGAA TATGCGCGCG ACAATCCCAG 300 

CCCTGTCGCT CCTGGTGACG CCGCGGCTCC CCTCGCTCGC CGTGCCGCTG GCTGGAGGCC 360 

GCCTCCGCGA GGGCGGTCGT TCTCGGACCC GCCTCCGCGT GGCGGCGCCG ACGTCCGTAC 420 

CAGGGGAAGC GGCGGAGCAG GCGGAGCCGA GCACGTCGGC GCCCGAGTCC GGCGAGAAGT 4 80 

TCTCGTGGAG GGATCACTGG TACCCGGTCT CCCTCGTCGA GGACCTCGAC CCCAGCCGCC 540 

CCACCCCGTT CCAGCTCCTC AACCGCGACC TCGTCATCTG GAAGGAACCC AAGTCCGGCG 600 

AGTGGGTCGC GCTCGACGAC CGCTGCCCCC ACCGCCTTGC CCCGCTCTCG GTACGGCGAC 660 

CCGCATCCCT TCCTCGCCTC ATCCGTGTCC TACCGGATCT CTTCCTCGTT TCGGCTAATT 720 

TTGGTCTGGG CATGTGCAGG AGGGCAGGAT CGATGAGACG GGGTGCTTGC AGTGCTCGTA 780 

TCACGGATGG TCATTCGATG GCTCCGGCGC CTGCACCAAG ATCCCCCAGG CCATGCCCGA 840 

GGGTCCTGAG GCCCGWGCGG TGCGGTCACC GAAGGCGTGC GCGATCAAGT TCCCCACCCT 900 

CGTCTCCCAG GGGCTGCTCT TCGTGTGGCC CGATGAGAAT GGGTGGGAGA AAGCGGCCGC 960 

CACCAAGCCT CCAATGTGCG TAGAGTCAGA CTTTGGACTG CGGCTAATTG GTTGGATTCA 102 0 

GTTTTGCATT TCGGTGTCTG AATTCGATCT TATTTGGTTT CAGGTTGCCG AAAGAATTTG 1080 

AGGACCCGGC CTTCTCCACG GTGACAATCC AGAGGGACTT GTTCTATGGT TATGATACGT 1140 

TGATGGAGAA CGTCTCTGAT CCGTCCCATA TAGAATTTGC TCACCACAAG GTACTTGGTA 1200 

CAGTGAGAAA GCTTAGTTGC TTGCCACACT TAAGCACCAT GATAGTATTT TTCAGTTGAA 1260 
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AGTTGGTGAT TCGAGGAAAG ATGTTTTGTT GCAACCAATT TGTGTAGTTT GCTAAAAAAT 1320 

CACCTCCTCA ATACTGTTTA ATTGTGTAGG CCTCTTATCG TTTCTGATTG CCAGTGTGCA 13 80 

AGTTTAACTA ACTGTTAGAT CTTAACTGTG GATGTACCCA TATATTTTTT TTGCATCATA 1440 

GTTTTATTCT TTTTTACTTA TGCTGCATTG AAATTCCTCA GAAATGACTT ATAATGGGCA 150 0 

AAAGGGCTGA ATGGCTGAGT CTGGCCTCTT ATCGTTTCTA GATTGCCAGC GTGCAAGTTT 1560 

AACTAAGGTC CCGTTTGGTT TGAGGGATTA AATATCAGTG CCTCCATTTT AGTCCCATTT 1620 

AGTCCATAAA TTGACAAACG GTGGGACTAA AACAAGGACT AAACTGTTCT AGTCTCTAGT 1680 

CCCTCAAGGG ATGACTCTAA GGGGCTAAAC CATAAAAATC CACTTTTTGG CCCTCCTTCA 174 0 

TTTCAGTTGC ACTAATGGCG GGAGGATGTT AAGGAGTATT TTGGTCTTCT TATGATTCAT 1800 

TTAATGTGTT TTGAATACTT ATAGTTTTTA GAACCAAACA GGGAGGGACT AAATTTTAGT 1860 

CTTCTAACTA AACTTTCGTC CCTGGACTAA AGGAACCAAA CCCTAACTGT TAGATCTTAA 192 0 

CTGTGGATGC ACCCATATAT ATTTTTGCAT CATAGTTTTA GTTCTTTTTT ACTTACGCTA 1980 

CTTGCTTAGT CTGAACAGGC ATTAATAGGG TGTTTGGTTT GAGGGATTAG TTAGTTCACC 2040 

CACTCATTCC TCTTTTCTTT GTTTGGTTTG TTGAATGGAG TAGGTTGGTC AGTGCATTAT 2100 

CACATCATTC CTCAGACTAG TAGTTAGTAC TAGTATGAAG AATGGGGTCA TTCAACCAAA 2160 

TTTAAGGAAT TGACTCATGA TGCATCACCA CATTTAGAAT GGAGTGGCTC CTCAAACCAA 2220 

ACCCTATAAA TGACTGGCTG AGTTAATTGT GCTATCTGTG TGTCATGAAC TTGTGCCGGC 2280 

AGCATAGACA AACAAAATGC TTTATTTTCT CGGGATACAT GGTTTCAGCA AATCCACTCA 2340 

TGTTTCAGAT TTTAACTCTT CACAGGTTAC TGGACGAAGA GATAGAGCCA GGCCTTTGAC 2400 

ATTCAGGATG GAATCAAGTG GTGCCTGGGG TTACTCAGGA GCAAATTCTG GTAATCCTCG 2460 

CATTACTGCA ACTTTTGAGG CCCCTTGTTA TGCATTAAAC AAGTAAGTTT CAGAAAAGTA 252 0 

CCTGGTCATC TTTGAGTGTG GAGTGATTCT TATTTACCAC TTAAGCAATT CAGTCGTTAT 2580 

ACGGTTCTGA ACTTCTGTTA ACTGGCTTGT ACAGAATAGA GATAGACACA AAGTTACCCA 2640 

TTTTTGGCGA CCAGAAATGG GTCATATGGA TTTGCTCTTT CAACATTCCA ATGGCCCCAG 2700 

GGAAGACTCG TTCTATTGTC TGTAGCGCTC GAAACTTTTT CCAGTTCACA ATGCCAGGAA 2760 

AAGCATGGTG GCAGGTACAT GTGTGTTTAG TGTTTCCTTT ACTTAAGCTT TGTTTTCCTA 282 0 

TTTGTTTTGT CAACATAATC TTTTAACTGC TAAAACGAAC TTGTTCTCGC GTTTTTGTGG 2 880 

GAAACAAGGC AAAGGTCCCT AGTCCCTACT GTAGGCATAT ATTATTGGCA GAGTTTATTA 294 0 

CTTGGTCATG TTTGAATTTA TATGTGTACA GTCAAATGTT GATAGCTTCT TTCTCTTGGT 3000 

GTAGCTTGTT CCTCGATGGT ATGAACATTG GACTTCAAAT TTGGTCTATG ATGGCGATAT 3060 
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GATCGTTCTT 


CAAGGCCAGG 


AGAAGATTTT 


CCTAGCTGCA 


ACCAAGGAGT CTTCTACGGA 


3120 


TATTAATCAG 


CAGTACACAA 


AGATCACATT 


CACGCCCACA 


CAAGCTGATC GATTTGTTTT 


3180 


AGCATGCCGC 


ACGTGGCTAA GGAAATTTGG 


CAATAGCCAG 


CCGGAGTGGT TTGGAAATCC 


3240 


TACACAAGAA 


GCATTGCCTT 


CCACCGTCCT 


TTCAAAGCGC 


GAGGTAAAAG CCATCTGGGT 


3300 


CACCAAAAAA 


GTTTCAGTAT 


AATATTTGCT 


TCAGACATAA AATATCTGAA TATGACAACC 


3360 


TTTTTGGTGG 


TCAAAGATCT 


GTTTTGCTTA 


CATTCTTAAT ACTCGATGCA TTGGTAAGTT 


3420 


ATTACAGTTA 


TCCTTTTTAC 


TCGATTTTTC 


CCTTTCTGAG 


CAGAACTATT ATCACGTCTT 


3480 


CATTGTTTGT 


ACACTTGGTT 


TCTATGACAC 


ACAAATTTTT 


ATTTTACATT ATCAGTTGTC 


3540 


ATATGAACTA 


ATGTATTTAC 


AGCAACCTGC 


TTAAGTGCTT 


AGTATCACAA AGGGACAAAT 


3600 


TCAATGAAAT 


ATTTGGAAAG 


ATAGTAGCGT 


CGAACCACTC 


TCACAGCTAG GCATTTGAGA 


3660 


ATAGTTACTT 


AACTGACAGC 


GAAGTTCACC 


TTCTACCGAC 


TGGATCTGGA AACAGTATCT 


3720 


TGAAGTAGTT 


CACACGTAAA 


CCTTCATCAG 


CTGTGTTTCT 


GGCTTCCAGT AACTCATGTA 


3780 


TTCTTATGAT 


TGACTTTGTG 


TTATGCAGAT 


GCTAGACAGA 


TACGAGCAGC TCTCGTTGAA 


3840 


ATGCTCGTCT 


TGCAAAGGAG 


CATATAATGC 


TTTCCAGAAT 


CTGCAGAAGG TATTCATGGG 


3900 


AGCGACAGTA 


GTTTGCTGTG 


CTGCCGCTGG 


TATTCCTCCA 


GATGTTCAGC TCAGGCTATT 


3960 


GATCGGTGCG 


GCTGCTTTGG 


TCAGTGCCGC 


TATAGCATAC 


GCATTCCATG AGCTC 


4015 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Pro lie Gin Lys Asp Ser Leu Phe lie Ser His His Lys He Pro He 
1 5 10 15 

Lys Gly Leu Asn Phe Ser He Lys He Glu Thr Phe Pro Gin Pro Phe 
20 25 30 

Thr Arg Gly Gly Ala Ala Val Leu Tyr Pro Leu Arg He Arg Arg Arg 
35 40 45 

Arg Ser Gly Ser Lys Lys Asn Thr Gly Gly Asp Lys Glu Glu Glu Gly 
50 55 60 

Ser Glu Phe Lys Trp Arg Asp His Trp Tyr Pro Val Ser Leu Val Glu 
65 70 75 80 
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Asp Leu Val Pro Asn Val Pro Thr Pro Phe Gin Leu Leu Gly Arg Asp 
85 90 95 

Leu Val Leu Trp Phe Asp Arg Asn Asp Gin Lys Trp Ala Ala Leu Phe 
100 105 110 

Tyr Gly Tyr Asp Thr Leu Met Glu Asn Val Ser Asp Pro Ser His lie 
115 120 125 

Asp Phe Ala His His Lys Val Thr Gly Arg Arg Asp Arg Ala Lys Pro 
130 135 14 0 

Leu Pro Phe Lys Val Glu Ser Ser Gly Pro Trp Gly Phe Gin Gly Ala 
145 150 155 160 

Asn Asp Asp Ser Pro Arg lie Thr Ala Lys Val Ala Pro Cys Tyr Ser 
165 170 " 175 

Met Asn Lys He Glu Leu Asp Ala Lys Leu Pro He Val Gly Asn Gin 
180 185 190 

Lys Trp Val He Trp He Cys Ser Phe Asn He Pro Met Ala Pro Gly 
195 200 205 

Lys Thr Arg Ser He Val Cys Ser Ala Arg Asn Phe Asp Asp Leu Cys 
210 215 220 

Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg Leu Asp Glu Asn Gly 
225 230 235 240 

His Leu Gin Cys Ser Tyr His Gly Trp Ser Phe Gly Gly Cys Gly Ser 
245 250 * 255 

Cys Thr Arg He Pro Gin Ala Ala Thr Ser Gly Pro Glu Ala Arg Ala 
260 265 270 

Val Lys Ser Pro Arg Ala Cys Ala He Lys Phe Pro Thr Met Val Ser 
275 280 285 

Gin Gly Leu Leu Phe Val Trp Pro Asp Glu Asn Gly Trp Asp Arg Ala 
290 295 300 

Asn Ser He Glu Pro Pro Arg Leu Pro Asp Asp Phe Asp Lys Pro Glu 
305 310 315 320 

Phe Ser Thr Val Thr He Gin Arg Asp Phe Phe Gin Phe Ser Val Pro 
325 330 335 

Gly Pro Ala Trp Trp Gin Val Pro Arg Trp Tyr Glu His Trp Thr Ser 
340 345 350 

Asn Leu Val Tyr Asp Gly Asp Met lie Val Leu Gin Gly Gin Glu Lys 
355 360 365 

Val Phe Leu Ala Lys Ser Met Glu Ser Pro Asp Tyr Asp Val Asn Lys 
370 375 380 



Gin Tyr Thr Lys Leu Thr Phe Thr Pro Thr Gin Ala Asp Arg Phe Val 
385 390 395 400 
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Leu Ala Phe Arg Asn Trp Leu Arg Arg His Gly Lys Ser Gin Pro Glu 
405 410 415 

Trp Phe Gly Ser Thr Pro Ser Asn Gin Pro Leu Pro Ser Thr Val Leu 
420 425 430 

Thr Lys Arg Gin Met Leu Asp Arg Phe Asp Gin His Thr Gin Val Cys 
435 440 445 

Ser Ser Cys Lys Gly Ala Tyr Asn Ser Phe Gin lie Leu Lys Lys Phe 
450 455 460 

Leu Val Gly Ala Thr Val Phe Trp Ala Ala Thr Ala Gly Val Pro Ser 
465 470 475 480 

Asp Val Gin He Arg Leu Val Leu Ala Gly Leu Ser Leu He Ser Ala 
485 490 495 

Ala Ser Ala Tyr Ala Leu His Glu Gin Glu Lys Asn Phe Val Phe Arg 
500 505 510 

Asp Tyr Val His Ser Glu He Glu 
515 520 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Cys Xaa His Xaa Cys Xaa His 
1 5 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) S TRANDEDNE S S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Glu Xaa Asp Xaa His Xaa His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 8: 



57 



SUBSTITUTE SHEET (RULE 26) 



WO 98/39422 



PCT/US98/04040 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asn Gin Cys His His Arg Gly Met Lys Leu Ser Arg Asp Asp Ala Gly 
15 10 15 

Asn Ala Lys Ala Pro Val Cys Thr Tyr His Gly Trp Ala His Asp He 
20 25 30 

Ser Gly Gin 
35 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asn Gin Cys Arg His Arg Gly Met Arg He Cys Arg Ser Asp Ala Gly 
15 10 15 

Asn Ala Lys Ala Pro Thr Cys Ser Tyr His Gly Trp Ala Tyr Asp He 
20 25 30 

Ala Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asn Gin Cys Arg His Arg Gly Met Arg He Cys Arg Ser Asp Ala Gly 
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15 10 15 

Asn Ala Lys Ala Pro Thr Cys Thr Tyr His Gly Trp Ala Tyr Asp lie 
20 25 30 

Ala Gly Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asn Gin Cys Arg His Arg Gly Met Arg lie Val Arg Ser Asp Gly Gly 
1 5 10 15 

Asn Ala Lys Ala Pro Thr Cys Thr Tyr His Gly Trp Ala Tyr Asp lie 
20 25 30 

Ala Gly Asn 
35 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asn Gin Cys Arg His Arg Gly Met Arg lie Cys Arg Ala Asp Ala Gly 
1 5 10 IS 

Asn Ala Lys Ala Pro Thr Cys Ser Tyr His Gly Trp Ala Tyr Asp Thr 
20 25 30 

Ala Gly Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asn Gin Cys Arg His Arg Gly Met Arg lie Cys Arg Ser Asp Ala Gly 

Asn Ala Lys Ala Pro Thr Cys Ser Tyr His Gly Trp Ala Tyr Asp Thr 
20 25 30 

Ala Gly Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asn Gin Cys Arg His Arg Gly Met Arg He Cys Arg Ala Asp Gly Gly 
1 5 .10 15 

Asn Ala Lys Ser Pro Thr Cys Ser Tyr His Gly Trp Ala Tyr Asp Ser 
20 25 30 

Ala Gly Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

Asn Gin Cys Arg His Arg Gly Met Arg He Cys Arg Ala Asp Gly Gly 
1 5 io 15 
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Asn Ala Lys Ser Pro Thr Cys Ser Tyr His Gly Trp Ala Tyr Asp Thr 
20 25 30 

Gly Gly Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Asn Ala Cys Ser His Arg Gly Ala Gin Leu Leu Gly His Lys Arg Gly 
1 5 10 i 5 

Asn Lys Thr Thr Tyr Thr Cys Pro Phe His Gly Trp Thr Phe Asn Asn 
20 25 30 

Ser Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asn Ala Cys Ser His Arg Gly Ala Thr Leu Cys Arg Phe Arg Ser Gly 
1 5 io is 

Asn Lys Ala Thr His Thr Cys Ser Phe His Gly Trp Thr Phe Ser Asn 
20 25 * 30 

Ser Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Asn Ser Cys Arg His Arg Gly Ala Leu Leu Cys Pro Phe Ser Lys Gly 
1 5 10 15 

Asn Gin Lys Phe His Val Cys Arg Tyr His Gly Trp Ser Tyr Asp Ser 
20 25 30 

Ser Gly Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Asn Val Cys Arg His Arg Gly Lys Thr Leu Val Ser Val Glu Ala Gly 
1 5 10 15 

Asn Ala Lys Gly Pro Val Cys Ser Tyr His Gly Trp Gly Phe Gly Ser 
20 25 30 

Asn Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Asn Val Cys Arg His Arg Gly Lys Thr Leu Val Asn Ala Glu Ala Gly 
1 5 10 15 

Asn Ala Lys Gly Pro Val Cys Gly Tyr His Gly Trp Gly Phe Gly Ser 
20 .25 30 
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Asn Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Asn Val Cys Arg His Arg Gly Lys Thr lie Val Asp Ala Glu Ala Gly 
15 10 15 

Asn Ala Lys Gly Pro Val Cys Gly Tyr His Gly Trp Gly Tyr Gly Ser 
20 25 A 30 

Asn Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ser Arg Cys Pro His Arg Gly Val Ser Leu Phe Met Gly Arg Val Lys 
1 5 ' 10 15 

Lys Gly Gly Leu Arg Cys Val Tyr His Gly Trp Lys Phe Ser Ala Glu 
20 25 30 

Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Ser Arg Cys Pro His Arg Gly Val Ser Leu Phe Met Gly Arg Val Lys 
15 10 15 

Lys Gly Gly Leu Arg Cys Val Tyr His Gly Trp Lys Phe Ser Ala Glu 
20 25 30 

Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Lys Tyr Cys Pro His Arg Arg Val Ser Leu lie Tyr Gly Arg Asn Lys 
15 10 15 

Asn Ser Gly Leu Arg Cys Leu Tyr His Gly Trp Lys Met Asp Val Asp 
20 25 30 

Gly Asn 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:25: 

Pro Arg Cys Met His Arg Gly Thr Ser Leu Tyr Tyr Gly His Val Lys 
15 10 15 

Lys Ala Gly lie Arg Cys Cys Tyr His Gly Trp Leu Phe Ala Val Asp 
20 25 30 

Gly Thr 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Asp Phe Cys Pro His Arg Gly Ala Pro Leu Ser Leu Gly Ser lie Gin 
15 10 15 

Asp Gly Lys Leu Val Cys Gly Tyr His Gly Leu Val Met Asp Cys Asp 
20 25 30 

Gly Arg 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Gly Tyr Cys Arg His Met Gly Gly Asp Leu Ser Glu Gly Thr Val Lys 
15 10 15 

Gly Asp Glu Val Ala Cys Pro Phe His Asp Trp Arg Trp Gly Gly Asp 
20 25 30 

Gly Arg 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Asp Arg Cys Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg He Asp 
1 5 10 15 

Glu Thr Gly Cys Leu Gin Cys Ser Tyr His Gly Trp Ser Phe Asr> Glv 
20 25 ^ 30 

Ser Gly Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Asp Leu Cys Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg Leu Asp 
1 5 10 is 

Glu Asn Gly His Leu Gin Cys Ser Tyr His Gly Trp Ser Phe Gly Gly 
20 25 30 

Cys Gly Ser 
35 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 

Asp Gin Cys Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg He Asn 
1 5 10 15 

Lys Ala Gly Gin Leu Glu Cys Pro Tyr His Gly Trp Thr Phe Ala Gly 
20 25 30 

Ser Gly Gin 
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35 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ser Thr Cys Ala His Arg Ala Cys Pro Leu Asp Leu Gly Thr Val Asn 
15 10 15 

Glu Gly Arg lie Gin Cys Pro Tyr His Gly Trp Glu Tyr Ser Thr Asp 
20 25 30 

Gly Asn 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Asn Thr Cys Ala His Arg Ala Cys Pro Leu His Leu Gly Ser Val Asn 
15 10 15 

Glu Gly Arg lie Gin Cys Pro Tyr His Gly Trp Glu Tyr Ser Thr Asp 
20 25 30 

Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 33: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Ala Ala Glu Gin Phe Cys Ser Asp Met Tyr His Ala Ala Thr Met Ser 
1 5 io 15 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Ala Ala Glu Gin Phe Cys Ser Asp Met Tyr His Ala Gly Thr Thr Thr 
1 5 10 is 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ala Ala Glu Gin Phe Cys Ser Asp Met Tyr His Ala Gly Thr Met Ala 
15 10 is 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ala Ala Glu Gin Phe Cys Ser Asp Met Tyr His Ala Gly Thr Met Ser 
15 10 15 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Ala Ala Glu Gin Phe Cys Ser Asp Met Tyr His Ala Gly Thr Thr Ser 
1 5 " 10 15 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Ala Ala Glu Gin Phe Cys Ser Asp Ala Tyr His Ala Gly Thr Thr Ser 

1-5 10 15 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) 'MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Ala Ala Glu Gin Phe Cys Ser Asp Met Tyr His Val Gly Thr Thr Ser 
1 5 io 15 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Ala Ala Glu Gin Phe Cys Ser Asp Met Tyr His Ala Gly Thr Thr Ser 
1 5 10 is 

His Leu 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Thr Ala Glu Asn Gly Ala Asp Gly Tyr His Val Ser Ala Val His Trp 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Gin Val Glu Asn Cys Ala Asp Gly Tyr His Val Ser Thr Val His Trp 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Gin Phe Glu Asn Gly Leu Asp Phe Tyr His Phe Gly Ser Thr His Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His Val Gly Trp Thr His 
1.5 10 15 

Ala 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Pro Ala Glu Asn Phe Val Gly Asp Ala Tyr His Val Gly Trp Thr His 
15 10 " 15 

Ala 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Pro Ala Glu Asn Phe Val Gly Asp He Tyr His He Gly Trp Thr His 
1 5 10 15 

Ala 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

Gin He Glu Asn Gly Ala Asp Gly Tyr His Val Gly Ser Val His Trp 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asn Leu Glu Gly Lys lie Asp Thr Ser His Phe Asn Pro Leu His Val 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

lie Leu Glu Gly Ala lie Asp Ser Ala His Ser Ser Ser Leu His Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Asn Trp Glu Asn lie Met Asp Pro Tyr His Val Tyr lie Leu His Ser 

1.5 10 15 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met lie Asp Asn Leu Met Asp Leu Thr His Glu Thr Tyr Val His Ala 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

lie lie Asp Asn Val Thr Asp Met Ala His Phe Phe Tyr lie His Phe 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Leu Met Glu Asn Val Ser Asp Pro Ser His He Glu Phe Ala His His 
l" 5 10 15 

Lys 



(2) INFORMATION FOR" SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

Leu Met Glu Asn Val Ser Asp Pro Ser His He Asp Phe Ala His His 
15 10 15 

Lys 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Leu Met Glu Asn Val Leu Asp Ser Ser His He Pro Tyr Thr His His 
1 5 10 * 15 

Lys 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Asp Arg Cys Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg He Asp 
15 10 15 

Glu Thr Gly Cys Leu Gin Cys Ser Tyr His Gly Trp Ser Pro Asp Gly 
20 25 30 

Ser Gly Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 57: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Asp Leu Cys Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg Leu Asp 
1 5 10 15 

Glu Asn Gly His Leu Gin Cys Ser Tyr His Gly Trp Ser Pro Gly Gly 
20 25 3 0 

Cys Gly Ser 
35 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Asp Gin Cys Pro His Arg Leu Ala Pro Leu Ser Glu Gly Arg He Asn 
1 5 io 15 

Lys Ala Gly Gin Leu Lys Cys Pro Tyr His Gly Trp Thr Pro Ala Gly 
20 25 " 30 

Ser Gly Gin 
35 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Leu Met Glu Asn Val Ser Asp Pro Ser His lie Glu Pro Ala His His 
15 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Leu Met Glu Asn Val Ser Asp Pro Ser His He Asp Pro Ala His His 
1 5 10 is 

Arg 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Leu Met Glu Asn Val Leu Asp Ser Ser His He Pro Tyr Thr His His 
15 10 15 

Arg 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TGGGGAACTT GATCGCGCAC GCCTTCGG 28 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TCGGGCATGG CCTGGGGGAT CTTGG 25 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "pimer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GGCCACGCGT CGACTAGTAC 20 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GTGCTCGGCT CCGCCTGCTC CGCCGCTTCC CCTGG 35 
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WHAT IS CLAIMED IS: 

1. A substantially purified plant protein which is 
capable of suppressing cell death in plants. 

2. The protein of claim 1, wherein said protein 
5 comprises a Rieske iron- coordinating motif. 

3. The protein of claim 2, wherein said protein also 
comprises a mononuclear iron-binding site. 

4. The protein of claim 1, wherein said protein has 
the amino acid sequence set forth in SEQ ID NOS 1 & 2 . 

10 5. The protein of claim 1, wherein said protein 

contains the carboxyterminal sequence set forth in SEQ ID 
NOS 1 & 2. 

6. An isolated nucleotide sequence which encodes a 
plant protein which suppresses cell death in plants. 

15 7. The nucleotide sequence of claim 6, wherein the 

plant protein comprises an amino acid sequence as set forth 
in SEQ ID NOS 1 & 2. 

8. The nucleotide sequence of claim 6, wherein said 
plant protein has a carboxyterminal sequence as the 

20 carboxyterminal sequence set forth in SEQ ID NOS 1 & 2 . 

9. The nucleotide sequence of claim 6, wherein said 
sequence comprises the sequence set forth in SEQ ID NO: 1. 

10. An isolated nucleotide molecule encoding a 
polypeptide capable of suppressing cell death in plants, 

25 said molecule having a sequence which hybridizes to the 
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carboxyterminal region of the nucleotide sequence of claim 8 
under stringent conditions. 

11. An isolated nucleotide molecule encoding a 
polypeptide capable of suppressing cell death in plants, 
said molecule having a sequence which has at least 70% 
sequence similarity of the sequence of claim 9 . 

12. A transformed plant comprising a chimeric gene, 
said gene comprising a nucleotide sequence which encodes a 
protein which suppresses plant cell death operably linked 
with a promoter which is operable in the plant cell. 

13. The transformed plant of claim 12, wherein said 
protein comprises a Rieske iron-coordinating motif. 

14. The transformed plant of claim 13, wherein said 
protein is a plant protein. 

15. The transformed plant of claim 14, wherein said 
protein comprises an amino acid sequence as set forth in SEQ 
ID NOS 1 & 2. 

16. The transformed plant of claim 14, wherein said 
protein has a carboxyterminal sequence as the 
carboxyterminal sequence set forth in SEQ ID NOS 1 & 2 . 

17. The transformed plant of claim 14, wherein said 
nucleotide sequence comprises the sequence set forth in SEQ 
ID NOS 1 & 2. 

18. Transformed seed from any of the plants of claims 
12-17. 
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19. A method for controlling cell death in a plant, 
said method comprising transforming said plant with an 
expression cassette comprising a chimeric gene, said gene 
comprising a nucleotide sequence which encodes a protein 

5 which suppresses plant cell death operably linked with a 
promoter which is operable in the plant cell . 

20. The transformed plant of claim 19, wherein said 
protein comprises a Rieske iron-coordinating motif. 

21. The method of claim 20, wherein said protein is a 
10 plant protein. 

22. The method of claim 20, wherein said protein 
comprises an amino acid sequence as set forth in SEQ ID NOS 
1 & 2. 

23. The method of claim 20, wherein said protein has a 
15 carboxyterminal sequence as the carboxy terminal sequence 

set forth in SEQ ID NOS 1 & 2. 

24. The method of claim 21, wherein said nucleotide 
sequence comprises the sequence set forth in SEQ ID NO: 1. 

25. A method for increasing resistance to disease in a 
20 plant, said method comprising transforming said plant with 

an expression cassette comprising a chimeric gene, said gene 
comprising a nucleotide sequence which encodes a protein 
which suppresses plant cell death operably linked with a 
promoter which is operable in the plant cell. 

25 26. The method of claim 24, wherein said disease is a 

result of plant pathogens . 
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27. The method of claim 26, wherein said plant 
pathogens are selected from the group consisting of viruses, 
bacteria, insects, and fungi. 

28. The method of claim 27, wherein said fungi is 
5 selected from the group consisting of Drechslera maydis, 

Fusarium moniliforme, Gibberella zeae, and Cochliobolus 
heterostrophus . 

29. The method of claim 25, wherein said protein has 
been modified to decrease protein activity. 

10 30. The method of claim 29, wherein said protein has 

been modified by substitution of amino acids. 

31. The method of claim 30, wherein said substitution 
comprises changing at least one Tyr residue to Ala. 

32. The method of claim 25, wherein said protein 
15 comprises a Rieske iron- coordinating motif. 

33. The method of claim 32, wherein said protein is a 
plant protein. 

34. * The method of claim 33, wherein said protein 
comprises an amino acid sequence as set forth in SEQ ID NOS 

20 1 & 2. 

35. The method of claim 33, wherein said protein has a 
carboxy terminal sequence as the carboxy terminal sequence 
set forth in SEQ ID NOS 1 & 2. 

36. The method of claim 33, wherein said nucleotide 
2 5 sequence comprises the sequence set forth in SEQ ID NO: 1 
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37. An isolated nucleotide sequence comprising a 
promoter sequence which is capable of driving expression of 
a gene in a plant cell wherein said promoter natively drives 
the expression of a plant cell death suppressor protein. 

38. The promoter of claim 37, wherein said promoter 
comprises the sequence set forth in SEQ ID NO: 3. 

39. A chimeric gene comprising the promoter of claim 
37, operably linked with a heterologous coding sequence. 

40. A vector comprising the chimeric gene of claim 39. 

41. A host cell comprising the vector of claim 40. 

42 . A plant which has been stably transformed with the 
chimeric gene of claim 40. 

43. Transformed seed of the plant of claim 42. 

44. A method for increasing transformation efficiency, 
said method comprising transforming a cell with a chimeric 
gene, said gene comprising a nucleotide sequence which 
encodes a protein which suppresses plant cell death operably 
linked with a promoter which is operable in the plant cell. 

45. The method of claim 44, wherein said protein 
comprises a Rieske iron- coordinating motif. 

46. The method of claim 45, wherein said protein is a 
plant protein. 

47. The method of claim 46, wherein said protein 
comprises an amino acid sequence as set forth in SEQ ID NOS 
1 & 2. 
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48. The method of claim 47, wherein said protein has a 
carboxyterminal sequence as the carboxyterminal sequence set 
forth in SEQ ID NOS 1 & 2 . 

49. The method of claim 47, wherein said nucleotide 
5 sequence comprises the sequence set forth in SEQ ID NO: 1. 

50. An isolated nucleotide sequence which comprises a 
sequence encoding a plant protein which suppresses cell 
death in plants. 

51. The nucleotide sequence of claim 50, wherein said 
10 sequence comprises the sequence set forth in SEQ ID NO: 4. 

52. A probe for mapping the presence of a nucleotide 
sequence, wherein said probe comprises a portion of the 
nucelotide sequence of claim 51. 

53. A chimeric gene comprising the nucleotide sequence 
15 of any of claims 6-11 operably linked with a heterologous 

promoter. 

54. A method for producing male sterile plants, said 
method comprising: 

transforming a cell from a plant of interest with an 
20 expression cassette comprising a chimeric gene, said 

chimeric gene comprising a stamen promoter operably linked 
to a modified nucleotide sequence which encodes a protein 
which natively supresses cell death, and 
regenerating a transformed plant. 

25 55. The method of claim 54, wherein said nucleotide 

sequence has been modified to encode a protein which 
exhibits a decrease in activity. 
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56. The method of claim 55, wherein said protein has 
been modified by substitution of amino acid residues. 

57. The method of claim 56, wherein said substitution 
comprises changing at least one Tyr residue to Ala. 
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